Stabilized coronavirus spike (S) protein immunogens and related vaccines

ABSTRACT

The present invention provides redesigned soluble coronavirus S protein derived immunogens that are stabilized via specific modifications in the wildtype soluble S sequences. Also provided in the invention are nanoparticle vaccines that contain the redesigned soluble S immunogens displayed on self-assembling nanoparticles. Polynucleotide sequences encoding the redesigned immunogens and the nanoparticle vaccines are also provided in the invention. The invention further provides methods of using the vaccine compositions in various therapeutic applications, e.g., for preventing or treating coronaviral infections.

CROSS-REFERENCE TO RELATED APPLICATIONS

The subject patent application is a continuation of U.S. patentapplication Ser. No. 17/019,825 (filed Sep. 14, 2020; now pending),which claims the benefit of priority to U.S. Provisional PatentApplication No. 63/045,557 (filed Jun. 29, 2020). The full disclosuresof the priority applications are incorporated herein by reference intheir entirety and for all purposes.

STATEMENT OF GOVERNMENT SUPPORT

This invention was made with government support under grant numbersAI139092 and AI137472 awarded by the National Institutes of Health. Thegovernment has certain rights in the invention.

BACKGROUND OF THE INVENTION

Coronaviruses (CoV) are enveloped viruses with a positive-stranded RNAgenome. In 2002, there was an outbreak of severe acute respiratorysyndrome (SARS) in Asia. In 2003, a novel coronavirus was identified tobe the causative agent of SARS and subsequently named SARS-CoV. Duringthe 2002-2003 outbreak, SARS-CoV infected over 8000 people with ˜10%fatality rate. In 2012, another coronavirus, Middle East respiratorysyndrome coronavirus (MERS-CoV), was identified. Since 2012, MERS-CoVhas infected over 2000 people in 27 countries with ˜35% fatality rate.In December 2019, a novel coronavirus designated as 2019-nCoV (orSARS-CoV-2) appeared in Wuhan, China. The first reported infectedindividuals, some of whom showed symptoms as early as December 8, werediscovered to be among stallholders from the Wuhan South China SeafoodMarket. On Jan. 10, 2020, gene sequencing determined that this novelcoronavirus, a β-coronavirus, is related to the MERS-CoV and theSARS-CoV. On Jan. 30, 2020, the WHO declares SARS-CoV-2 a public healthemergency of international concern (PHEIC), and on Mar. 11, 2020,characterized the situation as a pandemic. On May 24, 2020, the WHOCoronavirus Disease (COVID-19) Dashboard showed a total of 5,304,772confirmed cases in 216 countries, areas or territories, including342,029 deaths. SARS-CoV, MERS-CoV, and SARS-CoV-2 belong to theβ-coronavirus genus and are highly pathogenic zoonotic viruses. Inaddition to these three highly pathogenic β-coronaviruses, fourlow-pathogenicity β-coronaviruses, HCoV-OC43, HCoVHKU1, HCoV-NL63 andHCoV-229E, are also endemic in humans.

To date, no therapeutics or vaccines have been approved for treating orpreventing any human-infecting coronaviruses. There is a strong andurgent need in the art for effective vaccines against coronaviruses. Thepresent invention is directed to this and other pressing needs in theart.

SUMMARY OF THE INVENTION

In one aspect, the invention provides engineered immunogen polypeptidesthat are derived or modified from the spike (S) glycoprotein ofcoronaviruses including SARS-CoV, MERS-CoV and SARS-CoV-2. Relative to awildtype soluble S protein sequence of the coronavirus, the immunogenpolypeptides of the invention contain an altered soluble S sequence withmodifications that stabilize the prefusion S structure. In variousembodiments, the modifications include (a) a mutation that inactivatesthe S1/S2 cleavage site, and (b) a mutation in the turn region betweenthe heptad repeat 1 (HR1) region and the central helix (CH) region (seeFIG. 1 ) that prevents HR1 and CH to form a straight helix duringmembrane fusion process. In some embodiments, the immunogen polypeptidesof the invention also contain truncation of the heptad repeat 2 region(HR2) in addition to the stabilizing mutations noted above.

Some soluble S immunogen polypeptides of the invention are derived fromSARS-CoV-2. In some of these embodiments, the mutation inactivatingS1/S2 cleavage site can contain substitution of ⁶⁸²RRAR⁶⁸⁵ (SEQ IDNO:19) with GSAG (SEQ ID NO:20), and the mutation in the turn region cancontain double mutation K986G/V987G, K986P/V987P, K986G/V987P orK986P/V987G, using amino acid numbering based on cryo-EM model PDB ID6VSB as reference. In some embodiments, the wildtype soluble S sequencecontains the sequence shown in SEQ ID NO:14, or a substantiallyidentical or conservatively modified variant thereof. In someembodiments, truncation of HR2 entails deletion of the residues shown inSEQ ID NO:9 at the C-terminus of the wildtype soluble S sequence. Insome of these embodiments, the immunogen polypeptides can furtherinclude truncation of residues shown in SEQ ID NO:10 at the C-terminus.In some of these embodiments, the immunogen polypeptides containsubstitution of residues shown in SEQ ID NO:10 at the C-terminus of thewildtype soluble S sequence with residues GNS.

In some embodiments, the SARS-CoV-2 derived immunogen polypeptides ofthe invention can contain a N-terminal leader sequence shown in SEQ IDNO:15. In some embodiments, the immunogen polypeptide can furtherinclude in the region of HR1 that interacts with HR2 (a) one or moreproline or glycine substitutions, and/or (b) insertion of one or moreamino acid residues. In some of these embodiments, the immunogenpolypeptide can have one or more substitutions selected from A942P,S943P, A944P, A942G, S943G and A944G. In some of these embodiments, theinsertion can be insertion of G or GS between any residues in A942-A944.In some exemplified embodiments, the SARS-CoV-2 derived immunogenpolypeptides of the invention contain the sequence shown in any one ofSEQ ID NOs:32-37, or a substantially identical or conservativelymodified variant thereof.

Some soluble S immunogen polypeptides of the invention are derived fromSARS-CoV. In some of these embodiments, the mutation inactivating S1/S2cleavage site can be R667G substitution, and the mutation in the turnregion comprises double mutation K968G/V969G, K968P/V969P, K968G/V969Por K968P/V969G, using amino acid numbering based on UniProt ID P59594 asreference. In some embodiments, the wildtype soluble S sequence containsthe sequence shown in SEQ ID NO:7, or a substantially identical orconservatively modified variant thereof. In some embodiments, theSARS-CoV derived immunogen polypeptides of the invention containtruncation of HR2 (SEQ ID NO:9) at the C-terminus of the wildtypesoluble S sequence. In some of these embodiments, immunogen polypeptidescan additionally include truncation of residues shown in SEQ ID NO:10 atthe C-terminus. In some of these embodiments, the immunogen polypeptidescontain substitution of residues shown in SEQ ID NO:10 at the C-terminusof the wildtype soluble S sequence with residues GNS.

In some embodiments, the SARS-CoV derived immunogen polypeptides of theinvention can contain a N-terminal leader sequence shown in SEQ ID NO:8.In some embodiments, the immunogen polypeptides can further include inthe region of HR1 that interacts with HR2 (a) one or more proline orglycine substitutions, and/or (b) insertion of one or more amino acidresidues. In some of these embodiments, the immunogen polypeptide canhave one or more substitutions selected from S924P, T925P, A926P, S924G,T925G, and A926G. In some of these embodiments, the insertion can beinsertion of G or GS after any residue in S924-A926.

Some other soluble S immunogen polypeptides of the invention are derivedfrom MERS-CoV. In some of these embodiments, the mutation inactivatingS1/S2 cleavage site can contain R748A/R751G double mutation, and themutation in the turn region comprises double mutation V1060G/L1061G,V1060P/L1061P, V1060G/L1061P or V1060P/L1061G, using amino acidnumbering based on UniProt ID R9UQ53 as reference. In some embodiments,the wildtype soluble S sequence contains the sequence shown in SEQ IDNO:11 or a substantially identical or conservatively modified variantthereof. In some embodiments, MERS-CoV derived immunogen polypeptides ofthe invention contain truncation of HR2 (SEQ ID NO:13) at the C-terminusof the wildtype soluble S sequence.

In some embodiments, the MERS-CoV derived immunogen polypeptides of theinvention can contain a N-terminal leader sequence shown in SEQ IDNO:12. In some embodiments, the immunogen polypeptides can furtherinclude in the region of HR1 that interacts with HR2 (a) one or moreproline or glycine substitutions in the region of HR1 that interactswith HR2 in the region of HR1 that interacts with HR2 in the region ofHR1 that interacts with HR2, and/or (b) insertion of one or more aminoacid residues. In some of these embodiments, the immunogen polypeptidecan have one or more substitutions selected from T1013P, T1014P, T1015P,T1013G, T1014G and T1015G. In some of these embodiments, the insertioncan be insertion of residue G or GS after any residue in T1013-T1015.

In some embodiments, the coronavirus S protein derived immunogenpolypeptides of the invention can additionally include a trimerizationmotif at the C-terminus. In some of these embodiments, the trimerizationmotif is foldon or viral capsid protein SHP. In various embodiments, theemployed trimerization motif can contain the foldon sequence shown inSEQ ID NO:26 or the SHP sequence shown in SEQ ID NO:27, or asubstantially identical or conservatively modified variant thereof. Insome embodiments, the coronavirus S protein derived immunogenpolypeptides of the invention can additionally contain the subunitsequence of a self-assembling nanoparticle that is fused to the alteredsoluble S sequence. In some of these embodiments, C-terminus of thealtered soluble S sequence is fused to N-terminus of the nanoparticlesubunit sequence.

In another aspect, the invention provides polynucleotide sequences thatencode the coronavirus S protein derived immunogen polypeptidesdescribed herein. Some of the polynucleotide sequences encode a fusionpolypeptide containing the immunogen polypeptide that is fused at itsC-terminus to the N-terminus of the subunit sequence of aself-assembling nanoparticle.

In another aspect, the invention provides coronavirus vaccinecompositions that contain an immunogen polypeptide described herein thatis displayed on the surface of a self-assembling nanoparticle. In someof these embodiments, the self-assembling nanoparticle contains atrimeric sequence, and C-terminus of the immunogen polypeptide is fusedto N-terminus of the subunit sequence of the nanoparticle. In someembodiments, the employed self-assembling nanoparticle is composed offerritin, E2p or I3-01. Some nanoparticle vaccines of the inventiondisplay an engineered SARS-CoV-2 spike protein described herein.

In some embodiments, the nanoparticle vaccine contains (1) a polypeptidesequence containing from N terminus to C terminus (a) an engineeredSARS-CoV-2 spike polypeptide, a GS linker sequence, and nanoparticlesequence I3-01v9, (b) an engineered SARS-CoV-2 spike polypeptide, a GSlinker sequence, and nanoparticle sequence E2p, or (c) an engineeredSARS-CoV-2 spike polypeptide, a GS linker sequence, and nanoparticlesequence ferritin; or (2) a conservatively modified variant of thepolypeptide sequence. In some of these embodiments, the displayedSARS-CoV-2 spike immunogen polypeptide contains, relative to thewildtype spike sequence, (a) substitution of the S1/S2 cleavage site⁶⁸²RRAR⁶⁸⁵ (SEQ ID NO:19) with GSAG (SEQ ID NO:20), (b) double mutationsK986G/V987G in the turn region, and (c) truncation of HR2 (SEQ ID NO:9)at the C-terminus.

In some nanoparticle scaffolded SARS-CoV-2 vaccines of the invention,the displayed SARS-CoV-2 spike immunogen polypeptide contains thesequence shown in SEQ ID NO:33 or 34, or a conservatively modifiedvariant thereof. In some of these embodiments, the scaffolded vaccine iscomposed of (1) a subunit sequence containing from N terminus to Cterminus (a) the engineered SARS-CoV-2 spike polypeptide shown in SEQ IDNO:33, linker sequence (G₄S)₂ (SEQ ID NO:22), nanoparticle sequenceshown in SEQ ID NO:23 (I3-01v9), locking domain shown in SEQ ID NO:29(LD7), and T cell epitope shown in SEQ ID NO:30 (PADRE), (b) theengineered SARS-CoV-2 spike polypeptide shown in SEQ ID NO:33, linkersequence G₄S (SEQ ID NO:21), nanoparticle subunit sequence shown in SEQID NO:24 (E2p), locking domain shown in SEQ ID NO:28 (LD4), and T cellepitope shown in SEQ ID NO:30 (PADRE), or (c) the engineered SARS-CoV-2spike polypeptide shown in SEQ ID NO:33, linker sequence G₄S (SEQ IDNO:21), nanoparticle sequence shown in SEQ ID NO:25 (ferritin); or (2) aconservatively modified variant of the subunit sequence. In someembodiments, the subunit of the nanoparticle scaffolded vaccinescontains the sequence shown in any one of SEQ ID NOs:38-40, or asubstantially identical or conservatively modified variant thereof.

In still another aspect, the invention provides pharmaceuticalcompositions that contain the vaccine composition described herein, anda pharmaceutically acceptable carrier. In another aspect, the inventionprovides methods for preventing or treating a coronavirus infection in asubject. These methods involve administering to the subject apharmaceutically effective amount of a vaccine composition or apharmaceutical composition described herein.

A further understanding of the nature and advantages of the presentinvention may be realized by reference to the remaining portions of thespecification and claims.

DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the organization of different structural motifs ofcoronaviral spike (S) protein. The scheme shown in the figure reflectsthe structure of S protein of different coronaviruses encompassed by theinvention, e.g., SARS-CoV, MERS-CoV and SARS-CoV-2. The structuraldomains and motifs of the S protein shown in the figure include RBD,HR1, CH1 and HR2 domains or regions, as well as the S2 cleavage site(aka S1/S2 cleavage site) and the S2′ cleavage site. In addition to thevarious S structural components indicated in the figure, the amino acidresidues between HR1 and CH are denoted “the turn region” herein.

FIG. 2 is a schematic representation of the mouse immunization protocol.Groups of five mice were immunized four times with three-week intervals.All vaccine antigens (50 ug/injection) were formulated with AddaVax, anoil-in-water emulsion adjuvant, except for I3-01v9, which was formulatedwith aluminum phosphate (AP). The injections were done through theintraperitoneal (IP) route. Blood samples were collected two weeks aftereach injection.

FIG. 3 shows results from SARS-CoV-2 vaccine-induced antibody responsesin mice. SARS-CoV-2 spike/spike-NP vaccine-induced binding antibodyresponse. Listed in the figure are a summary of ED₅₀ titers measured forfive SARS-CoV-2 spike-based vaccine groups (S2P-5GS-1TD0,S2GΔHR2-5GS-1TD0, S2GΔHR2-5GS-FR, S2GΔHR2-5GS-E2p-L4P, andS2GΔHR2-10GS-I3-01v9-L7P) against three coating antigens in ELISA. ED₅₀values were calculated in GraphPad Prism 8.4.3. Of note, the ED₅₀ valuesat w2 were derived by setting the lower/upper constraints of OD₄₅₀ at0.0/3.2 to achieve greater accuracy.

FIG. 4 shows additional results from SARS-CoV-2 vaccine-induced antibodyresponses in mice. Listed in the figure are a summary of ID₅₀ titersmeasured for five SARS-CoV-2 spike-based vaccine groups (S2P-5GS-1TD0,S2GΔHR2-5GS-1TD0, S2GΔHR2-5GS-FR, S2GΔHR2-5GS-E2p-L4P, andS2GΔHR2-10GS-I3-01v9-L7P) against two pseudoviruses, SARS-CoV-1-pp andSARS-CoV-2-pp, in neutralization assays. ID₅₀ values were calculated inGraphPad Prism 8.4.3, with the lower/upper constraints of %neutralization set at 0.0/100.0.

FIG. 5 shows results of SARS-CoV-2 vaccine-induced T-cell responses inmice. (A)-(B): Vaccine-induced CD4+ T cell immunity. Splenocytes derivedfrom mice at w11 were cultured in the presence of DC-pulsed with theS2PECTO spike (1×10-7 mM), E2p SApNP (1×10-7 mM) and I3-01v9 SApNP(1×10-7 mM) for 16 hours (A) and 4 hours (B), respectively. (C)&(D):Vaccine-induced CD8+ T cell immunity. Splenocytes derived from mice atw11 were cultured in the presence of DC-pulsed with the S2PECTO spike(1×10-7 mM), E2p SApNP (1×10-7 mM) and I3-01v9 SApNP (1×10-7 mM) for 16hours (C) and 4 hours (D), respectively. Splenocytes from five naïvemice were used as the control samples and cultured with PBS. Plots showthe frequencies of cell fraction. The P values were determined byone-way ANOVA analysis. *, P<0.05; **, P<0.01; ***, P<0.001.

DETAILED DESCRIPTION I. Overview

For SARS-CoV (aka SARS-CoV-1), MERS-CoV, and SARS-CoV-2, the viralgenome encodes spike (S), envelope (E), membrane (M), and nucleocapsid(N) structural proteins, among which the S glycoprotein is responsiblefor binding the host receptor via the receptor-binding domain (RBD) inits S1 subunit, as well as the subsequent membrane fusion and viralentry driven by its S2 subunit. A possible membrane fusion process hasbeen proposed. The receptor binding may help to keep the RBD in a‘standing’ state, which facilitates the dissociation of the S1 subunitfrom the S2 subunit. When the S1 subunit is dissociated from the S2subunit, a second S2′ cleavage can release the fusion peptide. Theconnecting region, HR1 region and central helix would form an extremelylong helix (≥200 Å) to insert the fusion peptide into the host cellmembrane. Finally, the HR1 and HR2 regions will form a coiled structureand assemble into a six-helix bundle to merge the viral and hostmembranes.

In all the prefusion S structures solved for SARS-CoV, MERS-CoV, andSARS-CoV-2, the viral membrane proximal HR2 region is invisible,indicating high mobility in HR2. The RBD contains a core subdomain and areceptor-binding motif (RBM). While the core subdomains are highlysimilar between the three coronaviruses, their RBMs are markedlydifferent, leading to different receptor specificity: SARS-CoV andSARS-CoV-2 recognize the angiotensin-converting enzyme 2 (ACE2), whereasMERS-CoV binds the dipeptidyl peptidase 4 (DPP4). As the S glycoproteinis surface-exposed and mediates entry into host cells, it is the maintarget of neutralizing antibodies (NAbs) upon infection and the focus ofvaccine design. S trimers are extensively decorated with N-linkedglycans that are important for proper folding and for modulatingaccessibility to NAbs.

The present invention is predicated in part on the studies undertook bythe inventors to design nanoparticle vaccines for three highlypathogenic β-coronaviruses, SARS-CoV, MERS-CoV, and SARS-CoV-2 based ontwo rational strategies. In the first strategy, the inventors aimed tostabilize the S trimer in a prefusion conformation by eliminating thecauses of metastability in various regions of S, particularly HR1 and inHR2, prior to displaying it on nanoparticles. In the second vaccinestrategy, the inventors utilized the SpyTag/SpyCatcher protein supergluesystem to create RBD-presenting nanoparticles. A number of S proteinderived immunogen polypeptides and nanoparticle vaccine constructs weregenerated based on the design and examined for activities.

As exemplified herein with SARS-CoV-2 (and SARS-CoV-1) spike protein,the engineered spike immunogen polypeptides of the invention are morestable and represent more optimal vaccine design relative to the controlpolypeptides devoid of the engineering. Their advantageous biochemicaland structural properties as described herein indicate that they areamenable for rapid and large-scale vaccine production in the industrialsetting. When examined in vivo, it was found that the engineeredSARS-CoV-2 spike immunogens (e.g., S2GΔHR2) are more effective than thenon-engineered control protein to elicit potent anti-SARS-CoV-2 NAbresponses, alone or presented on self-assembling nanoparticle platforms(SApNPs). As detailed in the Examples herein, the exemplifiednanoparticle vaccines of the invention, e.g., S2GΔHR2-presenting I3-01v9SApNP, can also elicit a strong Th1 response as well as other types ofT-cell response needed for protective cellular immunity. Resultsobtained from the exemplified studies herein on the SARS-CoV-2 spikeprotein indicate that the engineered spike immunogen polypeptides of theinvention provide more effective next-generation vaccine candidates forevaluation in human trials.

The invention provides coronavirus immunogens and vaccine compositionsin accordance with the studies and exemplified designs described herein.Related polynucleotide sequences, expression vectors and pharmaceuticalcompositions are also provided in the invention. In various embodiments,stabilized S trimers and RBD proteins, in the forms of protein ornucleic acid (DNA/mRNA) carried by a viral vector can be used ascoronavirus vaccines. In addition, nanoparticles presenting stabilized Strimers and RBDs can be used as VLP-type coronavirus vaccines.

The coronavirus S-protein based immunogens and vaccines of the inventionhave several advantageous properties. The S trimer designs describedherein, which present conserved neutralizing epitopes in theirnative-like conformation, enable S trimers to be used as vaccineantigens or displayed multivalently on nanoparticles. Nanoparticlevaccines of the invention allows S trimers derived from the threedifferent coronaviruses to be displayed on well-known nanoparticleplatforms, such as ferritin, E2p, and I3-01 with a size ranging from12.2 to 25.0 nm. In addition, the use of high-stability hollow nanocages(E2p and 1VLW/I3-01 variants allows engineering of locking domains (LD),T-cell epitopes (e.g. PADRE), and peptide adjuvants within the nanocage,thus providing an all-in-one vaccine solution. All S trimer-presentingnanoparticles can be produced in ExpiCHO cells with high yield. SinceCHO is one of the principal mammalian cell lines used for industrialmanufacture of protein therapeutics and vaccines and ExpiCHO is atransient version of this CHO cell line, nanoparticles obtained from theExpiCHO production are expected to have the same properties as thosefrom industrial CHO production. Moreover, the high yield of S-presentingnanoparticles produced in ExpiCHO cells with antibody and SECpurification a will enable the development of a simple, robust, andcost-effective manufacturing process for industrial production.

Unless otherwise specified herein, the vaccine immunogens of theinvention, the encoding polynucleotides, expression vectors and hostcells, as well as the related therapeutic applications, can all begenerated or performed in accordance with the procedures exemplifiedherein or routinely practiced methods well known in the art. See, e.g.,Methods in Enzymology, Volume 289: Solid-Phase Peptide Synthesis, J. N.Abelson, M. I. Simon, G. B. Fields (Editors), Academic Press; 1stedition (1997) (ISBN-13: 978-0121821906); U.S. Pat. Nos. 4,965,343, and5,849,954; Sambrook et al., Molecular Cloning: A Laboratory Manual, ColdSpring Harbor Press, N.Y. (3^(rd) ed., 2000); Brent et al., CurrentProtocols in Molecular Biology, John Wiley & Sons, Inc. (ringbou ed.,2003); Davis et al., Basic Methods in Molecular Biology, ElsevierScience Publishing, Inc., New York, USA (1986); or Methods inEnzymology: Guide to Molecular Cloning Techniques Vol. 152, S. L. Bergerand A. R. Kimmerl Eds., Academic Press Inc., San Diego, USA (1987);Current Protocols in Protein Science (CPPS) (John E. Coligan, et. al.,ed., John Wiley and Sons, Inc.), Current Protocols in Cell Biology(CPCB) (Juan S. Bonifacino et. al. ed., John Wiley and Sons, Inc.), andCulture of Animal Cells: A Manual of Basic Technique by R. Ian Freshney,Publisher: Wiley-Liss; 5th edition (2005), Animal Cell Culture Methods(Methods in Cell Biology, Vol. 57, Jennie P. Mather and David Barneseditors, Academic Press, 1st edition, 1998). The following sectionsprovide additional guidance for practicing the compositions and methodsof the present invention.

Unless otherwise noted, the expression “at least” or “at least one of”as used herein includes individually each of the recited objects afterthe expression and the various combinations of two or more of therecited objects unless otherwise understood from the context and use.The expression “and/or” in connection with three or more recited objectsshould be understood to have the same meaning unless otherwiseunderstood from the context.

The use of the term “include,” “includes,” “including,” “have,” “has,”“having,” “contain,” “contains,” or “containing,” including grammaticalequivalents thereof, should be understood generally as open-ended andnon-limiting, for example, not excluding additional unrecited elementsor steps, unless otherwise specifically stated or understood from thecontext.

Where the use of the term “about” is before a quantitative value, thepresent invention also includes the specific quantitative value itself,unless specifically stated otherwise. As used herein, the term “about”refers to a ±10% variation from the nominal value unless otherwiseindicated or inferred.

Unless otherwise noted, the order of steps or order for performingcertain actions is immaterial so long as the present invention remainoperable. Moreover, two or more steps or actions may be conductedsimultaneously.

Unless otherwise noted, the use of any and all examples, or exemplarylanguage herein, for example, “such as” or “including,” is intendedmerely to illustrate better the present invention and does not pose alimitation on the scope of the invention. No language in thespecification should be construed as indicating any non-claimed elementas essential to the practice of the present invention.

II. Definitions

Unless defined otherwise, all technical and scientific terms used hereinhave the same meaning as commonly understood by those of ordinary skillin the art to which this invention pertains. The following referencesprovide one of skill with a general definition of many of the terms usedin this invention: Academic Press Dictionary of Science and Technology,Morris (Ed.), Academic Press (1^(st) ed., 1992); Oxford Dictionary ofBiochemistry and Molecular Biology, Smith et al. (Eds.), OxfordUniversity Press (revised ed., 2000); Encyclopaedic Dictionary ofChemistry, Kumar (Ed.), Anmol Publications Pvt. Ltd. (2002); Dictionaryof Microbiology and Molecular Biology, Singleton et al. (Eds.), JohnWiley & Sons (3^(rd) ed., 2002); Dictionary of Chemistry, Hunt (Ed.),Routledge (1^(st) ed., 1999); Dictionary of Pharmaceutical Medicine,Nahler (Ed.), Springer-Verlag Telos (1994); Dictionary of OrganicChemistry, Kumar and Anandand (Eds.), Anmol Publications Pvt. Ltd.(2002); and A Dictionary of Biology (Oxford Paperback Reference), Martinand Hine (Eds.), Oxford University Press (4^(th) ed., 2000). Furtherclarifications of some of these terms as they apply specifically to thisinvention are provided herein.

As used herein, the terms “antigen” or “immunogen” are usedinterchangeably to refer to a substance, typically a protein, which iscapable of inducing an immune response in a subject. The term alsorefers to proteins that are immunologically active in the sense thatonce administered to a subject (either directly or by administering tothe subject a nucleotide sequence or vector that encodes the protein) isable to evoke an immune response of the humoral and/or cellular typedirected against that protein. Unless otherwise noted, the term “vaccineimmunogen” is used interchangeably with “protein antigen” or “immunogenpolypeptide”.

The term “conservatively modified variant” applies to both amino acidand nucleic acid sequences. With respect to particular nucleic acidsequences, conservatively modified variants refer to those nucleic acidswhich encode identical or essentially identical amino acid sequences, orwhere the nucleic acid does not encode an amino acid sequence, toessentially identical sequences. Because of the degeneracy of thegenetic code, a large number of functionally identical nucleic acidsencode any given protein. For polypeptide sequences, “conservativelymodified variants” refer to a variant which has conservative amino acidsubstitutions, amino acid residues replaced with other amino acidresidue having a side chain with a similar charge. Families of aminoacid residues having side chains with similar charges have been definedin the art. These families include amino acids with basic side chains(e.g., lysine, arginine, histidine), acidic side chains (e.g., asparticacid, glutamic acid), uncharged polar side chains (e.g., glycine,asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolarside chains (e.g., alanine, valine, leucine, isoleucine, proline,phenylalanine, methionine, tryptophan), beta-branched side chains (e.g.,threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine,phenylalanine, tryptophan, histidine).

Epitope refers to an antigenic determinant. These are particularchemical groups or peptide sequences on a molecule that are antigenic,such that they elicit a specific immune response, for example, anepitope is the region of an antigen to which B and/or T cells respond.Epitopes can be formed both from contiguous amino acids or noncontiguousamino acids juxtaposed by tertiary folding of a protein.

Effective amount of a vaccine or other agent that is sufficient togenerate a desired response, such as reduce or eliminate a sign orsymptom of a condition or disease, such as pneumonia. For instance, thiscan be the amount necessary to inhibit viral replication or tomeasurably alter outward symptoms of the viral infection. In general,this amount will be sufficient to measurably inhibit virus (for example,SARS-CoV-2) replication or infectivity. When administered to a subject,a dosage will generally be used that will achieve target tissueconcentrations that has been shown to achieve in vitro inhibition ofviral replication. In some embodiments, an “effective amount” is onethat treats (including prophylaxis) one or more symptoms and/orunderlying causes of any of a disorder or disease, for example to treata coronavirus infection. In some embodiments, an effective amount is atherapeutically effective amount. In some embodiments, an effectiveamount is an amount that prevents one or more signs or symptoms of aparticular disease or condition from developing, such as one or moresigns or symptoms associated with coronaviral infections.

Unless otherwise noted, a fusion protein is a recombinant proteincontaining amino acid sequence from at least two unrelated proteins thathave been joined together, via a peptide bond, to make a single protein.Thus, it does not encompass the naturally existing coronaviruses surfaceantigen that is termed fusion (F) protein as described herein. Theunrelated amino acid sequences can be joined directly to each other orthey can be joined using a linker sequence. As used herein, proteins areunrelated, if their amino acid sequences are not normally found joinedtogether via a peptide bond in their natural environment (e.g., inside acell). For example, the amino acid sequences of bacterial enzymes suchas B. stearothermophilus dihydrolipoyl acyltransferase (E2p) and theamino acid sequences of a soluble coronavirus S glycoprotein are notnormally found joined together via a peptide bond.

Immunogen is a protein or a portion thereof that is capable of inducingan immune response in a mammal, such as a mammal infected or at risk ofinfection with a pathogen. Administration of an immunogen can lead toprotective immunity and/or proactive immunity against a pathogen ofinterest.

Immunogenic composition refers to a composition comprising animmunogenic polypeptide that induces a measurable CTL response againstvirus expressing the immunogenic polypeptide, or induces a measurable Bcell response (such as production of antibodies) against the immunogenicpolypeptide.

Sequence identity or similarity between two or more nucleic acidsequences, or two or more amino acid sequences, is expressed in terms ofthe identity or similarity between the sequences. Sequence identity canbe measured in terms of percentage identity; the higher the percentage,the more identical the sequences are. Two sequences are “substantiallyidentical” if two sequences have a specified percentage of amino acidresidues or nucleotides that are the same (i.e., 60% identity,optionally 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity over aspecified region, or, when not specified, over the entire sequence),when compared and aligned for maximum correspondence over a comparisonwindow, or designated region as measured using one of the followingsequence comparison algorithms or by manual alignment and visualinspection. Optionally, the identity exists over a region that is atleast about 50 nucleotides (or 10 amino acids) in length, or morepreferably over a region that is 100 to 500 or 1000 or more nucleotides(or 20, 50, 200 or more amino acids) in length.

Homologs or orthologs of nucleic acid or amino acid sequences possess arelatively high degree of sequence identity/similarity when alignedusing standard methods. Methods of alignment of sequences for comparisonare well known in the art. Various programs and alignment algorithms aredescribed in: Smith & Waterman, Adv. Appl. Math. 2:482, 1981; Needleman& Wunsch, J. Mol. Biol. 48:443, 1970; Pearson & Lipman, Proc. Natl.Acad. Sci. USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988;Higgins & Sharp, CABIOS 5:151-3, 1989; Corpet et al., Nuc. Acids Res.16:10881-90, 1988; Huang et al. Computer Appls. in the Biosciences 8,155-65, 1992; and Pearson et al., Meth. Mol. Bio. 24:307-31, 1994.Altschul et al., J. Mol. Biol. 215:403-10, 1990, presents a detailedconsideration of sequence alignment methods and homology calculations.

SpyCatcher-SpyTag refers to a protein ligation system that is based onbased on the internal isopeptide bond of the CnaB2 domain of FbaB, afibronectin-binding MSCRAMM and virulence factor of Streptococcuspyogenes. See, e.g., Terao et al., J. Biol. Chem. 2002; 277:47428-47435;and Zakeri et al., Proc. Natl. Acad. Sci. USA. 2012; 109:E690-E697. Itutilizes a modified domain from a Streptococcus pyogenes surface protein(SpyCatcher), which recognizes a cognate 13-amino-acid peptide (SpyTag).Upon recognition, the two form a covalent isopeptide bond between theside chains of a lysine in SpyCatcher and an aspartate in SpyTag. Thistechnology has been used, among other applications, to create covalentlystabilized multi-protein complexes, for modular vaccine production, andto label proteins (e.g., for microscopy). The SpyTag system is versatileas the tag is a short, unfolded peptide that can be genetically fused toexposed positions in target proteins; similarly, SpyCatcher can be fusedto reporter proteins such as GFP, and to epitope or purification tags.

The term “subject” refers to any animal classified as a mammal, e.g.,human and non-human mammals. Examples of non-human animals include dogs,cats, cattle, horses, sheep, pigs, goats, rabbits, and etc. Unlessotherwise noted, the terms “patient” or “subject” are used hereininterchangeably. Preferably, the subject is human.

The term “treating” or “alleviating” includes the administration ofcompounds or agents to a subject to prevent or delay the onset of thesymptoms, complications, or biochemical indicia of a disease (e.g., ACORONAVIRUS infection), alleviating the symptoms or arresting orinhibiting further development of the disease, condition, or disorder.Subjects in need of treatment include those already suffering from thedisease or disorder as well as those being at risk of developing thedisorder. Treatment may be prophylactic (to prevent or delay the onsetof the disease, or to prevent the manifestation of clinical orsubclinical symptoms thereof) or therapeutic suppression or alleviationof symptoms after the manifestation of the disease.

Vaccine refers to a pharmaceutical composition that elicits aprophylactic or therapeutic immune response in a subject. In some cases,the immune response is a protective immune response. Typically, avaccine elicits an antigen-specific immune response to an antigen of apathogen, for example a viral pathogen, or to a cellular constituentcorrelated with a pathological condition. A vaccine may include apolynucleotide (such as a nucleic acid encoding a disclosed antigen), apeptide or polypeptide (such as a disclosed antigen), a virus, a cell orone or more cellular constituents. In some embodiments of the invention,vaccines or vaccine immunogens or vaccine compositions are expressedfrom fusion constructs and self-assemble into nanoparticles displayingan immunogen polypeptide or protein on the surface.

Virus-like particle (VLP) refers to a non-replicating, viral shell,derived from any of several viruses. VLPs are generally composed of oneor more viral proteins, such as, but not limited to, those proteinsreferred to as capsid, coat, shell, surface and/or envelope proteins, orparticle-forming polypeptides derived from these proteins. VLPs can formspontaneously upon recombinant expression of the protein in anappropriate expression system. Methods for producing particular VLPs areknown in the art. The presence of VLPs following recombinant expressionof viral proteins can be detected using conventional techniques known inthe art, such as by electron microscopy, biophysical characterization,and the like. See, for example, Baker et al. (1991) Biophys. J.60:1445-1456; and Hagensee et al. (1994) J. Virol. 68:4503-4505. Forexample, VLPs can be isolated by density gradient centrifugation and/oridentified by characteristic density banding. Alternatively,cryoelectron microscopy can be performed on vitrified aqueous samples ofthe VLP preparation in question, and images recorded under appropriateexposure conditions.

A self-assembling nanoparticle refers to a ball-shape protein shell witha diameter of tens of nanometers and well-defined surface geometry thatis formed by identical copies of a non-viral protein capable ofautomatically assembling into a nanoparticle with a similar appearanceto VLPs. Known examples include ferritin (FR), which is conserved acrossspecies and forms a 24-mer, as well as B. stearothermophilusdihydrolipoyl acyltransferase (E2p), Aquifex aeolicus lumazine synthase(LS), and Thermotoga maritima encapsulin, which all form 60-mers.Self-assembling nanoparticles can form spontaneously upon recombinantexpression of the protein in an appropriate expression system. Methodsfor nanoparticle production, detection, and characterization can beconducted using the same techniques developed for VLPs.

III. Redesigned Coronavirus Soluble S Immunogens

The invention provides redesigned or modified soluble S sequences ofcoronaviruses that can be employed for generating vaccine compositions.The redesigned soluble S trimer immunogens or proteins are stabilized byintroducing modifications into the wildtype soluble S sequences ofcoronaviruses. Some specific wildtype soluble S sequences of specificSARS-CoV, MERS-CoV and SARS-CoV-2 strains or isolates are exemplifiedherein, e.g., SE ID NOs:1-3. Due to functional similarity and sequencehomology among different isolates or strains of a given coronavirus,redesigned soluble S immunogens derived from other known coronavirus Sprotein ortholog sequences can also be generated in accordance with theredesign strategy described herein. There are many known coronavirus Sprotein sequences that have been described in the literature. See, e.g.,James et al., J. Mol. Biol. 432:3309-25, 2020; Andersen et al., Nat.Med. 26:450-452, 2020; Walls et al., Cell 180:281-292, 2020; Zhang etal., J. Proteome Res. 19:1351-1360, 2020; Du et al., Expert Opin. Ther.Targets 21:131-143; 2017; Yang et al., Viral Immunol. 27:543-550, 2014;Wang et al., Antiviral Res. 133:165-177, 2016; Bosch et al., J. Virol.77:8801-8811, 2003; Lio et al., TRENDS Microbiol. 12:106-111, 2004;Chakraborti et al., Virol. J. 2:73, 2005; and Li, Ann. Rev. Virol.3:237-261, 2016.

As detailed herein, some redesigned soluble S immunogen polypeptides ofthe invention contain mutations that can enhance stability of theprefusion S structure. These include mutations that inactivate the S1/S2cleavage site, and mutations in HR1 that remove any strain in the turnregion between HR1 and CH, i.e., to prevent the formation of a straighthelix during fusion. In some embodiments, the resigned soluble Simmunogen polypeptides can additionally contain a truncation of the HR2motif. Truncation of the HR2 domain leads to disruption of the HR1/HR2fusion core and stabilizes the prefusion S structure.

Some engineered soluble S immunogen polypeptides are derived from aSARS-CoV-2 virus which caused COVID-19. Some of these polypeptidescontain a modified S1/S2 cleavage site. As exemplification, the wildtypesoluble S sequence to be used for engineering the SARS-CoV-2 immunogenpolypeptides of the invention is shown in SEQ ID NO:3 or N-terminalleader truncated soluble S sequence (SEQ ID NO:14). In otherembodiments, the wildtype S sequence to be used can be a variant of SEQID NO:3 or 14, e.g., a substantially identical or conservativelymodified variant thereof. Using amino acid numbering based on cryo-EMmodel PDB ID 6VSB or GenBank accession number MN908947.3 as reference,the modified cleavage site contains ⁶⁸²GSAGSV⁶⁸⁷ (SEQ ID NO:18).Inactivation of this cleavage site can be achieved by a number ofsequence alterations (e.g., deletions or substitutions) within or aroundthe site. One mutation that inactivates the cleavage site withoutotherwise impacting the structure of the protein is substitution ofresidues ⁶⁸²RRAR⁶⁸⁵ (SEQ ID NO:19) of the cleavage site with GSAG (SEQID NO:20), as exemplified herein. In addition to inactivation of thecleavage site, the soluble SARS-CoV-2 immunogen polypeptides canadditionally contain a double mutation in the HR1 region that removestrain in the turn region (between HR1 and CH motifs) during fusion bypreventing the formation of a straight helix. In various embodiments,this double mutation can be K986G/V987G, K986P/V987P, K986G/V987P orK986P/V987G.

Additional or alternative to the above-noted mutations that stabilizeprefusion S structure, some SARS-CoV-2 immunogen polypeptides of theinvention can contain a deletion of a substantial portion of or theentire HR2 domain. Using the exemplified soluble SARS-CoV-2 S sequenceSEQ ID NO:3 to illustrate, this deletion can encompass amino acidresidues 1150-1208 (SEQ ID NO:9). In various other embodiments, thedeletion can be a truncation of the first 35, 40, 45, 50, 55 or moreC-terminal residues of SEQ ID NO:3. In still some other embodiments, theC-terminal truncation of the wildtype soluble S sequence can extendbeyond the HR2 domain. In some of these embodiments, one or moreresidues in the region consisting residues 1139-1149 (SEQ ID NO:10) ofSEQ ID NO:3 can also be deleted. In some of these embodiments, theC-terminally truncated soluble S sequence can contain an insertedtripeptide motif, GNS, e.g., by substitution of residues 1139-1149 ofSEQ ID NO:3 with this motif. As described herein, this tripeptide motiffunctions to increase protein yield when the immunogen polypeptide isdisplayed on nanoparticles. In some other embodiments, the soluble Ssequence can include the N-terminal leader sequence shown in SEQ IDNO:15.

In some SARS-CoV-2 immunogen polypeptides of the invention, additionalmutations of the wildtype soluble S sequence can be introduced todestabilize the postfusion S structure. In some embodiments, one or moreproline and/or glycine substitution can be engineered in the region ofHR1 that interacts with HR2 to form the fusion core. These mutationsfunction to disrupt the six-helix-bundle fusion core. In variousembodiments, the mutations can include A942P, S943P, A944P, A942G, S943Gand A944G. In some embodiments, one or more extra amino acid residuescan be inserted into the region of HR1 that interacts with HR2 to formthe fusion core. Similarly, these insertions also function to disrupthelical pattern of the fusion core. In various embodiments, theinsertions can include insertion of G or GS between any residues inA942-A944.

As detailed in the Examples herein, several specific engineeredSARS-CoV-2 spike immunogen polypeptides have demonstrated enhancedimmunogenic properties relative to the wildtype SARS-CoV-2 spikeectodomain polypeptide or a well-known SARS-CoV-2 spike polypeptidecontaining a double-proline mutation (“S2P”). One of these exemplifiedSARS-CoV-2 spike polypeptides is S2GΔHR2 shown in SEQ ID NO:32. Relativeto the wildtype SARS-CoV-2 spike ectodomain sequence (SEQ ID NO:3),S2GΔHR2 contains substitution of the S1/S2 cleavage site sequence⁶⁸²RRARSV⁶⁸⁷ (SEQ ID NO:31) replaced with GSAGSV (SEQ ID NO:18). It alsocontains a K986G/V987G double mutation in HR1. Additionally, it has theHR2 region (E1150-Q1208) removed. As described herein, this engineeredSARS-CoV-2 spike immunogen polypeptide produced high-purity trimers,indicating a substantial reduction of spike metastability. It alsodisplayed higher affinity for representative mAbs specific for the spikein both ELISA and bio-layer interferometry (BLI) assays. When displayedon self-assembling nanoparticle scaffolds, this engineered proteinshowed satisfactory yield, purity, stability in production, andstructural integrity whereas the wild-type spike and the widely usedspike with a double proline mutation failed to express on any NPscaffold. The NP displayed S2GΔHR2 also showed improved antigenicitywhen tested against a panel of mAbs/Nabs. When examined in vivo, NPvaccines displaying this engineered spike also elicited neutralizingantibody responses that are up-to-10-folds stronger than the controlNPs.

Sequence of engineered SARS-CoV-2 spike protein “S2GΔHR2” (SEQ ID NO:32)is shown below. In the sequence, the N-terminal leader is italicized,the mutated S1/S2 cleavage site is underlined, and the substituted⁹⁸⁶GG⁹⁸⁷ residues are underlined and italicized.

MFVFLVLLPLVSS QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLD GG EAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDP LQPELDSFK

Some engineered soluble S immunogen polypeptides are derived from aSARS-CoV virus. Some of these polypeptides contain a modified S1/S2cleavage site. As exemplification, the wildtype soluble S sequence to beused for engineering the SARS-CoV immunogen polypeptides of theinvention is shown in SEQ ID NO:1 or N-terminal leader truncated solubleS sequence (SEQ ID NO:7). In other embodiments, the wildtype S sequenceto be used can be a variant of SEQ ID NO:1 or 7, e.g., a substantiallyidentical or conservatively modified variant thereof. Using amino acidnumbering based on UniProt ID P59594 or GenBank accession numberNP_828851 as reference, the modified sequence can contain a R667Gsubstitution, which leads to inactivation of the S1/S2 cleavage site. Inaddition to inactivation of the cleavage site, the soluble SARS-CoVimmunogen polypeptides can additionally a double mutation in the HR1region that remove strain in the turn region by preventing the formationof a straight helix during fusion. In various embodiments, this doublemutation can be K968G/V969G, K968P/V969P, K968G/V969P or K968P/V969G.

Additional or alternative to the above-noted mutations that stabilizeprefusion S structure, some SARS-CoV immunogen polypeptides of theinvention can contain a deletion of a substantial portion of or theentire HR2 domain. Using the exemplified soluble SARS-CoV S sequence SEQID NO:1 to illustrate, this deletion can encompass amino acid residues1132-1190 (SEQ ID NO:9). In various other embodiments, the deletion canbe a truncation of the first 35, 40, 45, 50, 55 or more C-terminalresidues of SEQ ID NO:1. In still some other embodiments, the C-terminaltruncation of the wildtype soluble S sequence can extend beyond the HR2domain. In some of these embodiments, one or more residues in the regionconsisting residues 1121-1131 (SEQ ID NO:10) of SEQ ID NO:1 can also bedeleted. In some of these embodiments, the C-terminally truncatedsoluble S sequence can contain an inserted tripeptide motif, GNS, e.g.,by substitution of residues 1121-1131 of SEQ ID NO:1 with this motif. Asdescribed herein, this tripeptide motif functions to increase proteinyield when the immunogen polypeptide is displayed on nanoparticles. Insome other embodiments, the soluble S sequence can have the N-terminalleader sequence truncated.

In some SARS-CoV immunogen polypeptides of the invention, additionalmutations of the wildtype soluble S sequence can be introduced todestabilize the postfusion S structure. In some embodiments, one or moreproline and/or glycine substitution can be engineered in the region ofHR1 that interacts with HR2 to form the fusion core. These mutationsfunction to disrupt the six-helix-bundle fusion core. In variousembodiments, the mutations can include S924P, T925P, A926P, S924G,T925G, and A926G. In some embodiments, one or more extra amino acidresidues can be inserted into the region of HR1 that interacts with HR2to form the fusion core. Similarly, these insertions also function todisrupt helical pattern of the fusion core. In various embodiments, theinsertions can include insertion of G or GS between any residues inA924-A926.

Some engineered soluble S immunogen polypeptides are derived from aMERS-CoV virus. Some of these polypeptides contain a modified S1/S2cleavage site. As exemplification, the wildtype soluble S sequence to beused for engineering the MERS-CoV immunogen polypeptides of theinvention is shown in SEQ ID NO:2 or N-terminal leader truncated solubleS sequence (SEQ ID NO:11). In other embodiments, the wildtype S sequenceto be used can be a variant of SEQ ID NO:2 or 11, e.g., a substantiallyidentical or conservatively modified variant thereof. Using amino acidnumbering based on UniProt ID R9UQ53 or GenBank accession numberJX869059.2 as reference, the modified sequence can contain a R748A/R751Gdouble mutation, which leads to inactivation of the S1/S2 cleavage site.In addition to inactivation of the cleavage site, the soluble MERS-CoVimmunogen polypeptides can additionally a double mutation in the HR1region that remove strain in the turn region by preventing the formationof a straight helix during fusion. In various embodiments, this doublemutation can be V1060G/L1061G, V1060P/L1061P, V1060G/L1061P orV1060P/L1061G.

Additional or alternative to the above-noted mutations that stabilizeprefusion S structure, some MERS-CoV immunogen polypeptides of theinvention can contain a deletion of a substantial portion of or theentire HR2 domain. Using the exemplified soluble MERS-CoV S sequence SEQID NO:2 to illustrate, this deletion can encompass amino acid residues1229-1291 (SEQ ID NO:13). In various other embodiments, the deletion canbe a truncation of the first 35, 40, 45, 50, 55, 60 or more C-terminalresidues of SEQ ID NO:2. In some other embodiments, the soluble Ssequence can have the N-terminal leader sequence truncated.

In some MERS-CoV immunogen polypeptides of the invention, additionalmutations of the wildtype soluble S sequence can be introduced todestabilize the postfusion S structure. In some embodiments, one or moreproline and/or glycine substitution can be engineered in the region ofHR1 that interacts with HR2 to form the fusion core. These mutationsfunction to disrupt the six-helix-bundle fusion core. In variousembodiments, the mutations can include T1013P, T1014P, T1015P, T1013G,T1014G and T1015G. In some embodiments, one or more extra amino acidresidues can be inserted into the region of HR1 that interacts with HR2to form the fusion core. Similarly, these insertions also function todisrupt helical pattern of the fusion core. In various embodiments, theinsertions can include insertion of G or GS between any residues inT1013-T1015.

In addition to the various substitutions and deletions noted above, theengineered coronavirus soluble S immunogen polypeptides of the inventioncan further contain a trimerization motif at the C-terminus. Suitabletrimerization motifs for the invention include, e.g., T4 fibritin foldon(PDB ID: 4NCV) and viral capsid protein SHP (PDB: 1TD0). T4 fibritin(foldon) is well known in the art, and constitutes the C-terminal 30amino acid residues of the trimeric protein fibritin from bacteriophageT4, and functions in promoting folding and trimerization of fibritin.See, e.g., Papanikolopoulou et al., J. Biol. Chem. 279: 8991-8998, 2004;and Guthe et al., J. Mol. Biol. 337: 905-915, 2004. Similarly, the SHPprotein and its used as a functional trimerization motis are also wellknown in the art. See, e.g., Dreier et al., Proc Natl Acad Sci USA 110:E869-E877, 2013; and Hanzelmann et al., Structure 24: 140-147, 2016. Thespecific foldon and SHP sequences exemplified herein areGYIPEAPRDGQAYVRKDGEWVLLSTFL (foldon; SEQ ID NO:26), andEVRIFAGNDPAHTATGSSGISSPTPALTPLMLDEATGKLVVWDGQKAGSAVGILVLPLEGTETALTYYKSGTFATEAIHWPESVDEHKKANAFAGSALSHAA (1TD0; SEQ ID NO:27).In some embodiments, the trimerization motif is linked to the redesignedsoluble S immunogen polypeptide via a short GS linker. The inclusion ofthe linker is intended to stabilize the formed trimer molecule. Invarious embodiments, the linker can contain 1-6 tandem repeats of GS. Insome embodiments, an His6-tag can be added to the C-terminus of thetrimerization motif to facilitate protein purification, e.g., by using aNickel column.

In addition to S2GΔHR2 described above, other exemplary engineeredSARS-CoV-2 spike proteins of the invention are shown in SEQ IDNOs:33-37. SEQ ID NO:33 is the sequence of S2GΔHR2 minus its N-terminalleader. Fusions of this sequence to trimerization motif foldon (SEQ IDNO:26) and 1TD0 (SEQ ID NO:27) are shown in SEQ ID NOs:35 and 36,respectively. In each of these two fusion sequences, a restriction siteAS is introduced at the C-terminus of the engineered spike protein,which is then connected to the N-terminus of the trimerization motif viaa G₄S linker. SEQ ID NO:34 is a variant of SEQ ID NO:33 containing a HR1swap. Specifically, the HR1 region L922-5943 is replaced by theequivalent region from SARS-CoV-1 spike protein. As exemplified herein,fusions containing this HR1 swapped SARS-CoV-2 spike protein to atrimerization motif (e.g., 1TD0) also displayed satisfactory immunogenicproperties only when the HR2 stalk was removed. One such fusion is shownin SEQ ID NO:37. Any of these exemplified sequences, substantiallyidentical sequences or conservatively modified variants thereof can beused in the invention for developing SARS-CoV-2 vaccines, e.g.,nanoparticle scaffolded vaccines.

Sequence of engineered SARS-CoV-2 spike: S2GΔHR2 (minus N-terminalleader) (SEQ ID NO:33):

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDP LQPELDSFK

Sequence of S2GΔHR2-foldon fusion (SEQ ID NO:35). In the sequence, theintroduced restriction site AS is italicized and underlined, the G₄Slinker is italicized, and the foldon sequence is underlined.

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDP LQPELDSFK AS GGGGSGYIPEAPRDGQAYVRKDGEWVLLSTFL

Sequence of S2GΔHR2-1TD0 fusion (SEQ ID NO:36). In the sequence, theintroduced restriction site AS is italicized and underlined, the G₄Slinker is italicized, and the 1TD0 sequence is underlined.

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDP LQPELDSFK AS GGGGSEVRIFAGNDPAHTATGSSGISSPTPALTPLMLDEATGKLVVWDGQKAGSAVGILVLPLEGTETALTYYKSGTFATEAIHWPE SVDEHKKANAFAGSALSHAA

Sequence of HR1 swapped S2GΔHR2 (SEQ ID NO:34): substituting HR1 regionfrom SARs-CoV-1 is underlined.

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDP LQPELDSFK

Sequence of fusion of HR1 swapped S2GΔHR2 to 1TD0 (SEQ ID NO:37). In thesequence, the introduced restriction site AS is italicized andunderlined, the G₄S linker is italicized, and the 1TD0 sequence isunderlined.

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDP LQPELDSFK AS GGGGSEVRIFAGNDPAHTATGSSGISSPTPALTPLMLDEATGKLVVWDGQKAGSAVGILVLPLEGTETALTYYKSGTFATEAIHWPE SVDEHKKANAFAGSALSHAA

IV. Nanoparticle Displayed Coronavirus Vaccine Compositions

The invention provides vaccine compositions that contain a heterologousscaffold that display at least one immunogen polypeptide or trimerprotein derived from coronavirus S proteins. In some embodiments, theemployed coronavirus S immunogen is a stabilized soluble S polypeptidecontaining various stabilizing mutations described above. In some otherembodiments, the employed coronavirus immunogen contains or is derivedfrom the RBD domain of coronavirus S proteins. In the latterembodiments, a SpyTag/SpyCatcher ligation system is used. As detailed inthe Examples herein, the RBD sequence can be fused to a SpyTag motif,and the nanoparticle subunit sequence can be fused to a SpyCatchermotif. Alternatively, the RBD sequence can be fused to a SpyCatchermotif, and the nanoparticle subunit sequence can be fused to a SpyTagmotif. In exemplified embodiments, the employed RBD sequence can containthe sequence shown in any one of SEQ ID NOs:4-6, or a substantiallyidentical or conservatively modified variant there. Upon introducing thetwo constructs expressing the SpyTag fusion and the SpyCatcher fusioninto host or producer cells, nanoparticle vaccines displaying an arrayof RBD proteins on the surface will be generated as a result ofSpyTag/SpyCatcher mediated ligation of RBD proteins to theself-assembled nanoparticles.

Any heterologous scaffold can be used to present the immunogen proteinor polypeptide in the construction of the vaccines of the invention.This includes a virus-like particle (VLP) such as bacteriophage Q_(β)VLP and nanoparticles. Various nanoparticle platforms can be employed ingenerating the vaccine compositions of the invention. In general, thenanoparticles employed in the invention need to be formed by multiplecopies of a single subunit. The nanoparticles are typically ball-likeshaped, and/or have rotational symmetry (e.g., with 3-fold and 5-foldaxis), e.g., with an icosahedral structure exemplified herein.Additionally or alternatively, the amino-terminus of the particlesubunit has to be exposed and in close proximity to the 3-fold axis, andthe spacing of three amino-termini has to closely match the spacing ofthe carboxyol-termini of the displayed trimeric stabilized soluble Sprotein.

In various embodiments, the employed self-assembling nanoparticles havea diameter of about 25 nm or less (usually assembled from 12, 24, or 60subunits) and 3-fold axes on the particle surface. Such nanoparticlesprovide suitable particle platforms to produce multivalent vaccines. Insome preferred embodiments, the coronavirus immunogen protein orpolypeptide can be presented on self-assembling nanoparticles such asself-assembling nanoparticles derived from ferritin (FR) or E2p asexemplified herein. Other examples of nanoparticles suitable for theinvention include nanoparticles derived from I3-01. Well known androutinely used in the art, ferritin is a globular protein found in allanimals, bacteria, and plants. As is well known in the art, it actsprimarily to control the rate and location of polynuclear Fe(III)₂O₃formation through the transportation of hydrated iron ions and protonsto and from a mineralized core. The globular form of ferritin is made upof monomeric subunit proteins (also referred to as monomeric ferritinsubunits), which are polypeptides having a molecule weight ofapproximately 17-20 kDa. E2p is a redesigned variant of dihydrolipoylacyltransferase from Bacillus stearothermophilus that has been shown toself-assemble into thermostable 60-meric nanoparticle. See, e.g., He etal., Nat. Commun. 7:12041, 2016. Similarly, I3-01 is an engineeredprotein that can self-assemble into hyperstable nanoparticles. See,e.g., Hsia et al., Nature 535, 136-139, 2016. Sequences of the subunitsof these proteins are known in the art. See, e.g., WO2017/192434. Moredetailed information on the structural and functional properties of thevarious nanoparticle scaffolds, as well as their use in presentingtrimeric protein immunogens, is provided in the art. See, e.g.,WO2017/192434, WO2019/089817 and WO2019/241483. In various embodiments,the coronavirus vaccine compositions of the invention can employ any ofthese known nanoparticles, as well as their conservatively modifiedvariants or variants with substantially identical (e.g., at least 90%,95% or 99% identical) sequences.

In addition to the nanoparticle sequences noted above, many othernanoparticles or VLPs known in the art may also be used in the practiceof the invention. These include, e.g., Aquifex aeolicus lumazinesynthase, Thermotoga maritima encapsulin, Myxococcus xanthus encapsulin,bacteriophage Qbeta virus particle, Flock House Virus (FHV) particle,ORSAY virus particle, and infectious bursal disease virus (IBDV)particle.

In some exemplary embodiments, the nanoparticle vaccines of theinvention contain a nanoparticle subunit sequence as shown in SEQ IDNO:23 (I3-01v9), SEQ ID NO:24 (E2p), or SEQ ID NO:25 (ferritin), aconservatively modified variant or a substantially identical sequencethereof. Typically, C-terminus of the engineered coronavirus immunogenpolypeptide is fused to the N-terminus of subunit of the self-assemblingnanoparticle (NP). In some embodiments, C-terminus of the engineeredcoronavirus polypeptide is fused to the nanoparticle subunit sequence ofthe self-assembling nanoparticle via a GS linker sequence, e.g., G₄S(GGGGS, SEQ ID NO:21) or (G₄S)₂ (GGGGSGGGGS; SEQ ID NO:22).

I3-01v9 subunit sequence (SEQ ID NO:23)

MKMEELFKKHKIVAVLRANSVEEAKMKALAVFVGGVHLIEITFTVPDADTVIKELSFLKELGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLDEEISQFCKEKGVFYMPGVMTPTELVKAMKLGHTILKLFPGEVVGPQFVKAMKGPFPNVKFVPTGGVNLDNVCEWFKAGVLAVGVGSALVKG TIAEVAAKAAAFVEKIRGCTE

E2p subunit sequence (SEQ ID NO:24)

AAAKPATTEGEFPETREKMSGIRRAIAKAMVHSKHTAPHVTLMDEADVTKLVAHRKKFKAIAAEKGIKLTFLPYVVKALVSALREYPVLNTAIDDETEEIIQKHYYNIGIAADTDRGLLVPVIKHADRKPIFALAQEINELAEKARDGKLTPGEMKGASCTITNIGSAGGQWFTPVINHPEVAILGIGRIAEKPIVRDGEIVAAPMLALSLSFDHRMIDGATAQKALNHI KRLLSDPELLLM

Ferritin sequence (SEQ ID NO:25)

DIIKLLNEQVNKEMNSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKFEGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGL YLADQYVKGIAKSRK

Other than the displayed soluble S immunogen, the nanoparticle vaccinecompositions of the invention can include additional motifs for betterbiological or pharmaceutical properties. These additional structuralcomponents can function to facilitate the immunogen display on thesurface of the nanoparticles, to enhance the stability of the displayedimmunogens, and/or to improve yield and purity of the self-assembledprotein vaccines. In these embodiments, one or more linkers (linkersequences, motifs or moieties) can be used to connect the variousstructural components in the constructs.

In some embodiments, the nanoparticle vaccines of the invention cancontain the coding sequence of a protein domain that serves to stabilizethe immunogen polypeptide, such as the trimerization motif of T4fibritin (“foldon”) as noted above, or to elevate the immunogenpolypeptide from the nanoparticle surface, such as a three-helix bundle(“neck domain”), or to facilitate immunoaffinity purification, such as aprotein domain with known binding antibodies. These sequences can beadded between the immunogen polypeptide sequence and the nanoparticlesubunit sequence.

In some of these embodiments, a trimerization motif such as foldon andviral capsid protein SHP (PDB: 1TD0) can be added to the C-terminus ofthe stabilized soluble S protein as exemplified herein. As describedabove, the trimerization motif can be inserted with a short GS linker tofurther stabilize the trimer and also to increase the trimer ratiowithin the total protein yield. In some embodiments, the coding sequenceof a polypeptide fragment or motif that serves as an active site forchemical conjugation can be inserted into the construct at anappropriate position. In some embodiments, additional structuralcomponents such as a CD4⁺ T-helper epitope or a CD8⁺ T-cell epitope canalso be inserted into the nanoparticle construct at an appropriateposition. These include, e.g., the PADRE T-helper epitope(AKFVAAWTLKAAA; SEQ ID NO:30) as exemplified herein. In some exemplaryembodiments, the T-helper epitope can be inserted to the C-terminus of alocking domain, which is in turn fused to the C-terminus of the NPsubunit sequence described below.

In some embodiments, the nanoparticle vaccines of the invention cancontain a locking domain that stabilizes the nanoparticle. The lockingdomain coding sequence can be fused directly or indirectly to theC-terminus of the nanoparticle subunit coding sequence. The lockingdomain stabilizes the nanoparticles from the inside so that thenanoparticles presenting the coronavirus immunogen polypeptide canremain intact during manufacture, vaccine formulation, and immunization.The nanoparticle vaccine immunogens thus constructed have significantlyenhanced stability. In general, the locking domain suitable for theinvention is a protein subunit that can naturally form a dimer withanother protein subunit in solution through non-covalent interactions atthe interface. In some preferred embodiments, the two protein subunitscan be identical in sequence and form a homodimer. In some otherembodiments, the two protein subunits can be different proteins, or twodifferent domains of a single protein derived through engineering, thatcan form a heterodimer in solution through non-covalent interactions atthe interface. Typically, the locking domain is covalently fused to thenanoparticle subunit to which the immunogen polypeptide is linked.Examples of specific locking domains and guidance on the use of alocking domain in the construction of nanoparticle displayed trimericimmunogens can be found in the art, e.g., WO2019/241483. In someexemplary embodiments, the employed LD contains the sequence shown inSEQ ID NO:28 (LD4) or 29 (LD7), a conservatively modified variant or asubstantially identical sequence thereof.

Locking domain LD4 (SEQ ID NO:28):

FSEEQKKALDLAFYFDRRLTPEWRRYLSQRLGLNEEQIERWFRRKEQQI GWSHPQFEK

Locking domain LD7 (SEQ ID NO:29):

SPAVDIGDRLDELEKALEALSAEDGHDDVGQRLESLLRRWNSRRAD

Nanoparticles displaying any of the stabilized coronavirus soluble Sprotein immunogens described herein (e.g., stabilized SARS-CoV-2 solubleS trimer immunogens) can be constructed by fusing the immunogenpolypeptide or subunit of multimeric immunogen protein (e.g., a trimerimmunogen) to the subunit of the nanoparticle (e.g., E2p or I3-01subunit), as well as the other optional or alternative componentsdescribed herein (e.g., a locking domain or a trimerization motif). Toconstruct the nanoparticle displayed fusion vaccine immunogens of theinvention, one or more linker motifs or moieties may be employed tofacilitate connection and maintain structural integrity of the differentcomponents. Typically, the linker motifs contain short peptidesequences, e.g., GS-rich peptides. In various embodiments, the linkersor linker motifs can be any flexible peptides that connect two proteindomains or motifs without interfering with their functions. For example,the employed linker can be a 5-aa G₄S linker (SEQ ID NO:21) or a 10-aa(G₄S)₂ linker (SEQ ID NO:22) as exemplified herein to connect (1) aspike protein and a nanoparticle scaffold sequence, (2) a spike proteinand a trimerization motif, and/or (3) a nanoparticle scaffold sequenceand a locking domain sequence. In some embodiments, a dipeptide GSlinker can be used to connect a locking domain to a T epitope asexemplified herein. Detailed procedures for recombinant production ofthe vaccine compositions of the invention can be based on the protocolsdescribed herein and/or other methods that have been described in theart, e.g., He et al., Nat. Comm. 7, 12041, 2016; Kong et al., Nat. Comm.7, 12040, 2016; He et al., Sci Adv. 4(11):eaau6769, 2018; He et al.,bioRxiv, 2020.2008.2022.262634, 2020; WO2017/192434; WO2019/089817 andWO2019/241483.

Sequences of several specific nanoparticle displayed SARS-CoV-2 spikeproteins of the invention are exemplified in SEQ ID NOs:38-40. SEQ IDNO:38 is the fusion sequence containing the leader-less S2GΔHR2 (SEQ IDNO:33) that is connected to nanoparticle sequence I3-01v9 (SEQ ID NO:23)via a (G₄S)₂ linker. This nanoparticle displayed spike further containsat its C-terminus the locking domain LD7 (SEQ ID NO:29) and the PADRET-epitope (SEQ ID NO:30). SEQ ID NO:39 is the fusion sequence containingthe leader-less S2GΔHR2 (SEQ ID NO:33) that is connected to nanoparticlesequence E2p (SEQ ID NO:24) via a G₄S linker. This nanoparticledisplayed spike further contains at its C-terminus the locking domainLD4 (SEQ ID NO:28) and the PADRE T-epitope (SEQ ID NO:30). SEQ ID NO:40is the fusion sequence containing the leader-less S2GΔHR2 (SEQ ID NO:33)that is connected to nanoparticle sequence ferritin (SEQ ID NO:25) via aG₄S linker. In addition to these specifically exemplified fusionconstructs, the invention also encompasses SARS-CoV-2 nanoparticlevaccines that contain a subunit sequence that is a substantiallyidentical to or conservatively modified variant of any of theseexemplified nanoparticle vaccine sequences.

Sequence of 3 exemplary SARS-CoV-2 nanoparticle vaccines are shown inSEQ ID NOs:38-40 below. In these sequences, GS linkers (1) between thespike protein and the nanoparticle subunit sequence, (2) between thenanoparticle subunit sequence and the locking domain and (3) between thelocking domain and the T-epitope are bolded, the nanoparticle subunitsequence is underlined, introduced restriction site AS is italicized andunderlined, the locking domain sequence is italicized, and the T-epitopesequence is underlined and bolded.

Sequence of S2GΔHR2-10GS-I3-01v9-LD7-PADRE (SEQ ID NO:38).

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSHAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPL QPELDSFK AS GGGGSGGGGSMKMEELFKKHKIVAVLRANSVEEAKMKALAVFVGGVHLIEITFTVPDADTVIKELSFLKELGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLDEEISQFCKEKGVFYMPGVMTPTELVKAMKLGHTILKLFPGEVVGPQFVKAMKGPFPNVKFVPTGGVNLDNVCEWFKAGVLAVGVGSALVKGTIAEVAAKAAAFVEKIRGCTE GGGGS SPAVDIGDRLDELEKALEALSAEDGHDDVGQRLESLLRRWNSRRADGS AKFVAAWTLKAAA

Sequence of nanoparticle vaccine S2GΔHR2-5GS-E2p-LD4-PADRE (SEQ IDNO:39):

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSHAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPL QPELDSFK AS GGGGSAAAKPATTEGEFPETREKMSGIRRAIAKAMVHSKHTAPHVTLMDEADVTKLVAHRKKFKAIAAEKGIKLTFLPYVVKALVSALREYPVLNTAIDDETEEIIQKHYYNIGIAADTDRGLLVPVIKHADRKPIFALAQEINELAEKARDGKLTPGEMKGASCTITNIGSAGGQWFTPVINHPEVAILGIGRIAEKPIVRDGEIVAAPMLALSLSFDHRMIDGATAQKALNHI KRLLSDPELLLMGGG GSFSEEQKKALDLAFYFDRRLTPEWRRYLSQRLG LNEEQIERWFRRKEQQIGWSHPQFEK GSAKFVAAWTLKAAA

Sequence of nanoparticle vaccine S2GΔHR2-5GS-ferritin (SEQ ID NO:40):

QCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPGSAGSVASQSHAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDGGEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPL QPELDSFK AS GGGGSDIIKLLNEQVNKEMQSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKFEGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKGIAKSRKS

V. Polynucleotides and Expression Constructs

The stabilized coronavirus soluble S immunogen proteins and the relatedvaccine compositions of the invention are typically produced by firstgenerating expression constructs (i.e., expression vectors) that containoperably linked coding sequences of the various structural componentsdescribed herein. Accordingly, in some related aspects, the inventionprovides substantially purified polynucleotides (DNA or RNA) that encodethe immunogens or nanoparticle displayed immunogens as described herein.Some polynucleotides of the invention encode one of the engineered spikeimmunogen polypeptides described herein, e.g., stabilized SARS-COV-2soluble S immunogens shown in SEQ ID NOs:32-37. Some polynucleotides ofthe invention encode the subunit sequence of one of the nanoparticlescaffolded vaccines described herein, e.g., the fusion protein sequencesshown in SEQ ID NOs:38-40. While the expressed spike immunogenpolypeptides of the invention typically do not contain the N-terminalleader sequence, some of the polynucleotide sequences of the inventionadditionally encode the leader sequence of the native spike protein.Thus, for example, polynucleotides encoding engineered SARS-COV-2 spikeimmunogen polypeptides (e.g., SEQ ID NOs:33-37) or the nanoparticlescaffolded polypeptide sequences (e.g., SEQ ID NO:38-40) canadditionally encode the native leader sequence shown in SEQ ID NO:15, ora substantially identical or conservatively modified variant sequence.

Also provided in the invention are expression vectors that harbor suchpolynucleotides (e.g., CMV vectors exemplified herein) and host cellsfor producing the vaccine immunogens (e.g., HEK293F, ExpiCHO, and CHO-Scell lines exemplified herein). The fusion polypeptides encoded by thepolynucleotides or expressed from the vectors are also included in theinvention. As described herein, the nanoparticle subunit fused soluble Simmunogen polypeptides will self-assemble into nanoparticle vaccinesthat display the immunogen polypeptides or proteins on its surface.

The polynucleotides and related vectors can be readily generated withstandard molecular biology techniques or the protocols exemplifiedherein. For example, general protocols for cloning, transfecting,transient gene expression and obtaining stable transfected cell linesare described in the art, e.g., Sambrook et al., Molecular Cloning: ALaboratory Manual, Cold Spring Harbor Press, N.Y., (3^(rd) ed., 2000);and Brent et al., Current Protocols in Molecular Biology, John Wiley &Sons, Inc. (ringbou ed., 2003). Introducing mutations to apolynucleotide sequence by PCR can be performed as described in, e.g.,PCR Technology: Principles and Applications for DNA Amplification, H. A.Erlich (Ed.), Freeman Press, NY, NY, 1992; PCR Protocols: A Guide toMethods and Applications, Innis et al. (Ed.), Academic Press, San Diego,CA, 1990; Manila et al., Nucleic Acids Res. 19:967, 1991; and Eckert etal., PCR Methods and Applications 1:17, 1991.

The selection of a particular vector depends upon the intended use ofthe fusion polypeptides. For example, the selected vector must becapable of driving expression of the fusion polypeptide in the desiredcell type, whether that cell type be prokaryotic or eukaryotic. Manyvectors contain sequences allowing both prokaryotic vector replicationand eukaryotic expression of operably linked gene sequences. Vectorsuseful for the invention may be autonomously replicating, that is, thevector exists extrachromosomally and its replication is not necessarilydirectly linked to the replication of the host cell's genome.Alternatively, the replication of the vector may be linked to thereplication of the host's chromosomal DNA, for example, the vector maybe integrated into the chromosome of the host cell as achieved byretroviral vectors and in stably transfected cell lines. Bothviral-based and nonviral expression vectors can be used to produce theimmunogens in a mammalian host cell. Nonviral vectors and systemsinclude plasmids, episomal vectors, typically with an expressioncassette for expressing a protein or RNA, and human artificialchromosomes (see, e.g., Harrington et al., Nat. Genet. 15:345, 1997).Useful viral vectors include vectors based on lentiviruses or otherretroviruses, adenoviruses, adenoassociated viruses, Cytomegalovirus,herpes viruses, vectors based on SV40, papilloma virus, HBP Epstein Barrvirus, vaccinia virus vectors and Semliki Forest virus (SFV). See, Brentet al., supra; Smith, Annu. Rev. Microbiol. 49:807, 1995; and Rosenfeldet al., Cell 68:143, 1992.

Depending on the specific vector used for expressing the fusionpolypeptide, various known cells or cell lines can be employed in thepractice of the invention. The host cell can be any cell into whichrecombinant vectors carrying a fusion of the invention may be introducedand wherein the vectors are permitted to drive the expression of thefusion polypeptide is useful for the invention. It may be prokaryotic,such as any of a number of bacterial strains, or may be eukaryotic, suchas yeast or other fungal cells, insect or amphibian cells, or mammaliancells including, for example, rodent, simian or human cells. Cellsexpressing the fusion polypeptides of the invention may be primarycultured cells or may be an established cell line. Thus, in addition tothe cell lines exemplified herein (e.g., CHO cells), a number of otherhost cell lines capable well known in the art may also be used in thepractice of the invention. These include, e.g., various Cos cell lines,HeLa cells, Sf9 cells, HEK293, AtT20, BV2, and N18 cells, myeloma celllines, transformed B-cells and hybridomas.

The use of mammalian tissue cell culture to express polypeptides isdiscussed generally in, e.g., Winnacker, From Genes to Clones, VCHPublishers, N.Y., N.Y., 1987. The fusion polypeptide-expressing vectorsmay be introduced to the selected host cells by any of a number ofsuitable methods known to those skilled in the art. For the introductionof fusion polypeptide-encoding vectors to mammalian cells, the methodused will depend upon the form of the vector. For plasmid vectors, DNAencoding the fusion polypeptide sequences may be introduced by any of anumber of transfection methods, including, for example, lipid-mediatedtransfection (“lipofection”), DEAE-dextran-mediated transfection,electroporation or calcium phosphate precipitation. These methods aredetailed, for example, in Brent et al., supra. Lipofection reagents andmethods suitable for transient transfection of a wide variety oftransformed and non-transformed or primary cells are widely available,making lipofection an attractive method of introducing constructs toeukaryotic, and particularly mammalian cells in culture. For example,LipofectAMINE™ (Life Technologies) or LipoTaxi™ (Stratagene) kits areavailable. Other companies offering reagents and methods for lipofectioninclude Bio-Rad Laboratories, CLONTECH, Glen Research, LifeTechnologies, JBL Scientific, MBI Fermentas, PanVera, Promega, QuantumBiotechnologies, Sigma-Aldrich, and Wako Chemicals USA.

For long-term, high-yield production of recombinant fusion polypeptides,stable expression is preferred. Rather than using expression vectorswhich contain viral origins of replication, host cells can betransformed with the fusion polypeptide-encoding sequences controlled byappropriate expression control elements (e.g., promoter, enhancer,sequences, transcription terminators, polyadenylation sites, etc.), andselectable markers. The selectable marker in the recombinant vectorconfers resistance to the selection and allows cells to stably integratethe vector into their chromosomes. Commonly used selectable markersinclude neo, which confers resistance to the aminoglycoside G-418(Colberre-Garapin, et al., J. Mol. Biol., 150:1, 1981); and hygro, whichconfers resistance to hygromycin (Santerre et al., Gene, 30: 147, 1984).Through appropriate selections, the transfected cells can containintegrated copies of the fusion polypeptide encoding sequence.

VI. Pharmaceutical Compositions and Therapeutic Applications

In another aspect, the invention provides pharmaceutical compositionsand related therapeutic methods of using the redesigned coronavirus Simmunogens and nanoparticle vaccine compositions as described herein. Invarious embodiments, the pharmaceutical compositions can contain theengineered viral spike proteins or RBD polypeptides, nanoparticlescaffolded viral spike immunogens, as well as polynucleotide sequencesor vectors encoding the engineered viral spike immunogens ornanoparticle vaccines described herein. In some embodiments, the solubleS trimer immunogen for the different viruses (e.g., SARS-COV-2) can beused for preventing and treating the corresponding viral infections. Invarious other embodiments, the nanoparticle vaccines containingdifferent viral or non-viral immunogens described herein can be employedto prevent or treat the corresponding diseases, e.g., infections causedby the various coronaviruses. Some embodiments of the invention relateto use of the SARS-COV-2 immunogens or vaccines for preventing ortreating SARS-COV-2 infections in human subjects. Some embodiments ofthe invention relate to use of the SARS-CoV immunogens or vaccines forpreventing or treating SARS-CoV infections. Some embodiments of theinvention relate to use of the MERS-CoV immunogens or vaccines forpreventing or treating MERS-CoV infections.

In the practice of the various therapeutic methods of the invention, thesubjects in need of prevention or treatment of a disease or condition(e.g., SARS-COV-2 infection) is administered with the correspondingnanoparticle vaccine, the immunogen protein or polypeptide, or anencoding polynucleotide described herein. Typically, the nanoparticlevaccine, the immunogen protein or the encoding polynucleotide disclosedherein is included in a pharmaceutical composition. The pharmaceuticalcomposition can be either a therapeutic formulation or a prophylacticformulation. Typically, the composition can additionally include one ormore pharmaceutically acceptable vehicles and, optionally, othertherapeutic ingredients (for example, antiviral drugs). Variouspharmaceutically acceptable additives can also be used in thecompositions.

Thus, some of the pharmaceutical compositions of the invention arevaccine compositions. For vaccine compositions, appropriate adjuvantscan be additionally included. Examples of suitable adjuvants include,e.g., aluminum hydroxide, lecithin, Freund's adjuvant, MPL™ and IL-12.In some embodiments, the vaccine compositions or nanoparticle immunogensdisclosed herein (e.g., SARS-COV-2 vaccine composition) can beformulated as a controlled-release or time-release formulation. This canbe achieved in a composition that contains a slow release polymer or viaa microencapsulated delivery system or bioadhesive gel. The variouspharmaceutical compositions can be prepared in accordance with standardprocedures well known in the art. See, e.g., Remington's PharmaceuticalSciences, 19^(th) Ed., Mack Publishing Company, Easton, Pa., 1995;Sustained and Controlled Release Drug Delivery Systems, J. R. Robinson,ed., Marcel Dekker, Inc., New York, 1978); U.S. Pat. Nos. 4,652,441 and4,917,893; 4,677,191 and 4,728,721; and 4,675,189.

The pharmaceutical compositions of the invention can be readily employedin a variety of therapeutic or prophylactic applications, e.g., fortreating SARS-COV-2 infection or eliciting an immune response toSARS-COV-2 in a subject. In various embodiments, the vaccinecompositions can be used for treating or preventing infections caused bya pathogen from which the displayed immunogen polypeptide in thenanoparticle vaccine is derived. Thus, the vaccine compositions of theinvention can be used in diverse clinical settings for treating orpreventing infections caused by various viruses. As exemplification, aSARS-COV-2 nanoparticle vaccine composition can be administered to asubject to induce an immune response to SARS-COV-2, e.g., to induceproduction of broadly neutralizing antibodies to the virus. For subjectsat risk of developing an SARS-COV-2 infection, a vaccine composition ofthe invention can be administered to provide prophylactic protectionagainst viral infection. Therapeutic and prophylactic applications ofvaccines derived from the other immunogens described herein can besimilarly performed. Depending on the specific subject and conditions,pharmaceutical compositions of the invention can be administered tosubjects by a variety of administration modes known to the person ofordinary skill in the art, for example, intramuscular, subcutaneous,intravenous, intra-arterial, intra-articular, intraperitoneal, orparenteral routes.

In general, the pharmaceutical composition is administered to a subjectin need of such treatment for a time and under conditions sufficient toprevent, inhibit, and/or ameliorate a selected disease or condition orone or more symptom(s) thereof. In various embodiments, the therapeuticmethods of the invention relate to methods of blocking the entry of acoronavirus (e.g., SARS-CoV, SARS-CoV-2, or MERS-CoV) into a host cell,e.g., a human host cell, methods of preventing the S protein of acoronavirus from binding the host receptor, and methods of treatingacute respiratory distress that is often associated with coronavirusinfections. In some embodiments, the therapeutic methods andcompositions described herein can be employed in combination with otherknown therapeutic agents and/or modalities useful for treating orpreventing coronavirus infections. The known therapeutic agents and/ormodalities include, e.g., a nuclease analog or a protease inhibitor(e.g., remdesivir), monoclonal antibodies directed against one or morecoronaviruses, an immunosuppressant or anti-inflammatory drug (e.g.,sarilumab or tocilizumab), ACE inhibitors, vasodilators, or anycombination thereof.

For therapeutic applications, the compositions should contain atherapeutically effective amount of the nanoparticle immunogen describedherein. For prophylactic applications, the compositions should contain aprophylactically effective amount of the nanoparticle immunogendescribed herein. The appropriate amount of the immunogen can bedetermined based on the specific disease or condition to be treated orprevented, severity, age of the subject, and other personal attributesof the specific subject (e.g., the general state of the subject's healthand the robustness of the subject's immune system). Determination ofeffective dosages is additionally guided with animal model studiesfollowed up by human clinical trials and is guided by administrationprotocols that significantly reduce the occurrence or severity oftargeted disease symptoms or conditions in the subject.

For prophylactic applications, the immunogenic composition is providedin advance of any symptom, for example in advance of infection. Theprophylactic administration of the immunogenic compositions serves toprevent or ameliorate any subsequent infection. Thus, in someembodiments, a subject to be treated is one who has, or is at risk fordeveloping, an infection (e.g., SARS-COV-2 infection), for examplebecause of exposure or the possibility of exposure to the virus (e.g.,SARS-COV-2). Following administration of a therapeutically effectiveamount of the disclosed therapeutic compositions, the subject can bemonitored for an infection (e.g., SARS-COV-2 infection), symptomsassociated with an infection (e.g., SARS-COV-2 infection), or both.

For therapeutic applications, the immunogenic composition is provided ator after the onset of a symptom of disease or infection, for exampleafter development of a symptom of infection (e.g., SARS-COV-2infection), or after diagnosis of the infection. The immunogeniccomposition can thus be provided prior to the anticipated exposure tothe virus so as to attenuate the anticipated severity, duration orextent of an infection and/or associated disease symptoms, afterexposure or suspected exposure to the virus, or after the actualinitiation of an infection. The pharmaceutical composition of theinvention can be combined with other agents known in the art fortreating or preventing infections by a relevant pathogen (e.g.,SARS-COV-2 infection).

The nanoparticle vaccine compositions containing novel structuralcomponents as described in the invention (e.g., SARS-COV-2 vaccine) orpharmaceutical compositions of the invention can be provided ascomponents of a kit. Optionally, such a kit includes additionalcomponents including packaging, instructions and various other reagents,such as buffers, substrates, antibodies or ligands, such as controlantibodies or ligands, and detection reagents. An optional instructionsheet can be additionally provided in the kits.

EXAMPLES

The following examples are offered to illustrate, but not to limit thepresent invention.

Example 1 S Antigen Stabilization, Production, and Purification

This Example describes redesigned stable and soluble coronavirus Strimers:

I. SARS-CoV

The sequence of SARS-CoV S protein was obtained from GenBank with the IDNP_828851. The numbering is based on the UniProt definition with UniProID P59594. The soluble S construct is defined as M1-Q1190. Q1190 isimmediately upstream of the predicted transmembrane region that startswith the ¹¹⁹¹YIK¹¹⁹³ motif A truncated soluble S construct is defined asM1-K1131, which is devoid of HR2. The HR2 deletion will disrupt theHR1/HR2 fusion core and stabilize the prefusion S structure. The Sconstruct can be further truncated at Y1120 with a 3-residue “GNS” motif(from MERS-CoV S) added to Y1120. This modification will increaseprotein yield significantly when displayed on nanoparticles.

Uncleaved, prefusion-optimized (UFO) S constructs can be obtained by (a)adding a R667G mutation and a K968P/V969P (or K968G/V969G) doublemutation between the HR1 and the central helix (CH). While the R667Gmutation aims to remove the S1/cleavage site, the K968P/V969P doublemutation has been shown to stabilize the prefusion S structure. Insteadof rigidifying the HR1-turn-CH, the K968G/V969G double mutation aims toremove any strain in the turn region and as a result to stabilize theprefusion S structure.

The UFO S constructs described above can be further stabilized byintroducing a proline mutation (S924P, T925P, or A926P), a glycinemutation (S924G, T925G, or A926G), or their combinations to the HR1region that interacts with HR2 to form a fusion core. These mutationsfunction to disrupt the six-helix-bundle fusion core and destabilize thepostfusion S. Other mutations such as inserting one or two residues(e.g. G or GS) in the region S924-A926 to disrupt the helical patterncan also destabilize the postfusion S and prevent conformational change.

Trimerization motifs such as T4 fibritin foldon (PDB ID: 4NCV) and viralcapsid protein SHP (PDB: 1TD0) can be further added to the C-terminus ofa redesigned S construct described above with a short GS linker inbetween to stabilize the trimer. In addition, an His6-tag can be addedto the C-terminus of the trimerization motif to facilitate proteinpurification using a Nickel column.

The C-terminus of the redesigned SARS-CoV UFO S constructs can be fusedto the N-terminus of a nanoparticle-forming subunit (ferritin 24-mer andtwo 60-mers, E2p and I3-01) so that the fusion construct, when expressedin appropriate cell lines, can self-assemble into nanoparticles withprefusion S trimers displayed on the nanoparticle surface.

SARS-CoV soluble S sequence (SEQ ID NO:1):

MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNVVRGWVFGSTMNNKSQSVIIINNSTNVVIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIYQTSNFRVVPSGDVVRFPNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSEILDISPCAFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTNFSISITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGSFCTQLNRALSGIAAEQDRNTREVFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVTYVPSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQIITTDNTFVSGNCDVVIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLN ESLIDLQELGKYEQ

SARS-CoV soluble S sequence minus N-terminal leader (SEQ ID NO:7):residues 14-1190 of SEQ ID NO:1.

Leader sequence (SEQ ID NO:8): MFIFLLFLTLTSG (residues 1-13 of SEQ IDNO:1).

HR2 sequence (SEQ ID NO:9): residues 1132-1190 of SEQ ID NO:1EELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELG KYEQ

Further truncated C-terminal sequence (SEQ ID NO:10): DPLQPELDSFK(residues 1121-1131 of SEQ ID NO:1).

II. MERS-CoV

The sequence of MERS-CoV S protein was obtained from GenBank with the IDJX869059.2. The amino acid numbering is based on the UniProt definitionwith UniPro ID R9UQ53.

The soluble S construct is defined as M1-Y1291. Y1291 is immediatelyupstream of the predicted transmembrane region that starts with the¹²⁹²YNK¹²⁹⁴ motif. A truncated soluble S construct is defined asM1-S1226, which is devoid of HR2. The HR2 deletion will disrupt theHR1/HR2 fusion core and stabilize the prefusion S structure.

Uncleaved, prefusion-optimized (UFO) S constructs can be obtained byadding a R748A/R751G double mutation and a V1060P/L1061P (orV1060G/L1061G) double mutation. While the R748A/R751G double mutationaims to remove the S1/S2 cleavage site, the V1060P/L1061P doublemutation has been shown to stabilize the prefusion S structure. Insteadof rigidifying the HR1-turn-CH, the V1060G/L1061G double mutation aimsto remove any strain in the turn region and as a result to stabilize theprefusion S structure.

The UFO S constructs can be further stabilized by introducing a prolinemutation (T1013P, T1014P, or T1015P), a glycine mutation (T1013G,T1014G, or T1015G), or their combinations to the HR1 region thatinteracts with HR2 to form a fusion core. These mutations will disruptthe six-helix-bundle fusion core and destabilize the postfusion S. Othermutations such as inserting one or two residues (e.g. G or GS) in theregion T1013-T1015 to disrupt the helical pattern will also destabilizethe postfusion S and prevent conformational change.

Trimerization motifs such as T4 fibritin foldon (PDB ID: 4NCV) and viralcapsid protein SHP (PDB: 1TD0) can be added to the C-terminus of theredesigned UFO S constructs described above with a short GS linker inbetween to stabilize the trimer. An His6-tag can be added to theC-terminus of the trimerization motif to facilitate protein purificationby a Nickel column.

The C-terminus of the redesigned MERS-CoV UFO S constructs can be fusedto the N-terminus of a nanoparticle-forming subunit so that the fusionconstruct, when expressed in appropriate cell lines, can self-assembleinto nanoparticles with prefusion S trimers displayed on thenanoparticle surface.

MERS-CoV soluble S (SEQ ID NO:2):

MIHSVFLLMFLLTPTESYVDVGPDSVKSACIEVDIQQTFFDKTWPRPIDVSKADGITYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTPQKLFVANYSQDVKQFANGFVVRIGAAANSTGTVIISPSTSATIRKIYPAFMLGSSVGNFSDGKMGRFFNHTLVLLPDGCGTLLRAFYCILEPRSGNHCPAGNSYTSFATYHTPATDCSDGNYNRNASLNSFKEYFNLRNCTFMYTYNITEDEILEWFGITQTAQGVHLFSSRYVDLYGGNMFQFATLPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFNDLSQLHCSYESFDVESGVYSVSSFEAKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCVSIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCPKLEFANDTKIASQLGNCVEYSLYGVSGRGVFQNCTAVGVRQQRFVYDAYQNLVGYYSDDGNYYCLRACVSVPVSVIYDKETKTHATLFGSVACEHISSTMSQYSRSTRSMLKRRDSTYGPLQTPVGCVLGLVNSSLFVEDCKLPLGQSLCALPDTPSTLTPRSVRSVPGEMRLASIAFNHPIQVDQLNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSSPIIPGFGGDFNLTLLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPASARDLICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIAPVNGYFIKTNNTRIVDEWSYTGSSFYAPEPITSLNTKYVAPQVTYQNISTNLPPPLLGNSTGIDFQDELDEFFKNVSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVK ALNESYIDLKELGNYTY

MERS-CoV soluble S sequence minus N-terminal leader (SEQ ID NO:11):residues 18-1291 of SEQ ID NO:2.

Leader sequence (SEQ ID NO:12): MIHSVFLLMFLLTPTES (residues 1-17 of SEQID NO:2).

HR2 sequence (SEQ ID NO:13): residues 1227-1291 of SEQ ID NO:2TGIDFQDELDEFFKNVSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALNESYI DLKELGNYTY

III. SARS-CoV-2

The sequence of SARS-CoV-2 S protein was obtained from GenBank with theID MN908947.3. The amino acid numbering is based on the cryo-EM modelwith PDB ID 6VSB.

The soluble S construct is defined as M1-Q1208. Q1208 is immediatelyupstream of the predicted transmembrane region that starts with the¹²⁰⁹YIK¹²¹¹ motif. A truncated soluble S construct is defined asM1-K1149, which is devoid of HR2. The HR2 deletion will disrupt theHR1/HR2 fusion core and stabilize the prefusion S structure. The Sconstruct can be further truncated at Y1138 with a 3-residue “GNS” motif(from MERS-CoV S) added to Y1138. This modification will increaseprotein yield significantly when displayed on nanoparticles.

The uncleaved, prefusion-optimized (UFO) soluble S construct is definedas M1-Q1208 with the modified S1/S2 cleavage site ⁶⁸²GSAGSV⁶⁸⁷ (SEQ IDNO:18) and a K986P/V987P (or K986G/V987G) double mutation. SARS-CoV-2has a 4-aa insertion prior to the S1/S2 cleavage site, ⁶⁸¹PRRA⁶⁸⁴, whichwill enhance the cleavage efficiency. The modification ⁶⁸²GSAGSV⁶⁸⁷ (SEQID NO:18) aims to remove the S1/S2 cleavage site, and the K986P/V987Pdouble mutation has been shown to stabilize the prefusion S structure.Instead of rigidifying the HR1-turn-CH, the K986G/V987G double mutationaims to remove any strain in the turn region and as a result tostabilize the prefusion S structure.

The SARS-CoV-2 UFO S constructs in (b) can be further stabilized byintroducing a proline mutation (A942P, S943P, and A944P), a glycinemutation (A942G, S943G, and A944G), or their combinations to the HR1region that interacts with HR2 to form a fusion core. These mutationswill disrupt the six-helix-bundle fusion core and destabilize thepostfusion S. Other mutations such as inserting one or two residues(e.g. G or GS) in the region A942-A944 to disrupt the helical patternwill also destabilize the postfusion S and prevent conformationalchange.

Trimerization motifs such as T4 fibritin foldon (PDB ID: 4NCV) and viralcapsid protein SHP (PDB: 1TD0) can be added to the C-terminus of aredesigned S construct in (b) and (c) with a short GS linker in betweento stabilize the trimer. An His6-tag can be added to the C-terminus ofthe trimerization motif to facilitate protein purification by a Nickelcolumn.

The C-terminus of the redesigned SARS-CoV-2 UFO S construct describedabove can be fused to the N-terminus of a nanoparticle-forming subunitso that the fusion construct, when expressed in appropriate cell lines,can self-assemble into nanoparticles with prefusion S trimers displayedon the nanoparticle surface.

SARS-CoV-2 soluble S (SEQ ID NO:3):

MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSHAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQ

SARS-CoV-2 soluble S sequence minus N-terminal leader (SEQ ID NO:14):residues 14-1208 of SEQ ID NO:3.

Leader sequence (SEQ ID NO:15): MFVFLVLLPLVSS (residues 1-13 of SEQ IDNO:3).

HR2 sequence (SEQ ID NO:9): residues 1150-1208 of SEQ ID NO:3EELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELG KYEQ

Further truncated C-terminal sequence (SEQ ID NO:10): DPLQPELDSFK(residues 1121-1131 of SEQ ID NO:3)

Example 2 Designed RBD Domains of Coronaviruses I. SARS-CoV RBD BasedVaccines

The sequence of SARS-CoV S protein and amino acid numbering are notedabove. The SARS-CoV RBD used in RBD-based vaccine design is defined asP317-D518 (see SEQ ID NO:4). Specifically, a trimerization motif, theviral capsid protein SHP (PDB: 1TD0), can be added to the C-terminus ofSARS-CoV RBD with a short 5GS linker in between to stabilize RBD in atrimeric conformation. A His6-tag can be added to the C-terminus of thetrimerization motif with a 1GS linker to facilitate purification.

SpyTag and SpyCatcher can be attached to SARS-CoV RBD and a nanoparticlesubunit in different combinations to facilitate the multivalent displayof RBD on nanoparticle. For example, if the C-terminus of RBD is fusedto the N-terminus of SpyTag with a 5GS linker, the C-terminus ofSpyCatcher can be fused to the N-terminus of a nanoparticle subunit witha 5GS linker to create a pair. SpyTag and SpyCatcher can be switched inthese two constructs to create a different pair. SpyTag or SpyCatchercan also be fused to the N-terminus of RBD with a 5GS linker. When thetwo constructs are introduced into and expressed in the host cells, arecombinant vaccine protein will be formed through the binding betweenthe SpyTag and SpyCatcher motifs.

SARS-CoV RBD (SEQ ID NO:4)

PNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPATVCG PKLSTD

SpyTag: VPTIVMVDAYKRYK (SEQ ID NO:16).

SpyCatcher: SEQ ID NO:17

AMVTTLSGLSGEQGPSGDMTTEEDSATHIKFSKRDEDGRELAGATMELRDSSGKTISTWISDGHVKDFYLYPGKYTFVETAAPDGYEVATAITFTVNE QGQVTVNGEATKGDAHTAS

II. MERS-CoV RBD Based Vaccines

The sequence of MERS-CoV S protein and amino acid numbering are notedabove. The MERS-CoV RBD used in RBD-based vaccine design is defined asE382-K587 (see SEQ ID NO:5). A trimerization motif, the viral capsidprotein SHP (PDB: 1TD0), can be added to the C-terminus of MERS-CoV RBDwith a short 5GS linker in between to stabilize RBD in a trimericconformation. A His6-tag can be added to the C-terminus of thetrimerization motif with a 1GS linker to facilitate purification.

SpyTag and SpyCatcher can be attached to MERS-CoV RBD and a nanoparticlesubunit in different combinations to facilitate the multivalent displayof RBD on nanoparticle. For example, if the C-terminus of RBD is fusedto the N-terminus of SpyTag with a 5GS linker, the C-terminus ofSpyCatcher can be fused to the N-terminus of a nanoparticle subunit witha 5GS linker to create a pair. SpyTag and SpyCatcher can be switched inthese two constructs to create a different pair. SpyTag or SpyCatchercan also be fused to the N-terminus of RBD with a 5GS linker. When thetwo constructs are introduced into and expressed in the host cells, arecombinant vaccine protein will be formed through the binding betweenthe SpyTag and SpyCatcher motifs.

MERS-CoV RBD (SEQ ID NO:5)

ECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCVSIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQY GTDTNSVCPK

SpyTag: VPTIVMVDAYKRYK (SEQ ID NO:16).

SpyCatcher: SEQ ID NO:17:

AMVTTLSGLSGEQGPSGDMTTEEDSATHIKFSKRDEDGRELAGATMELRDSSGKTISTWISDGHVKDFYLYPGKYTFVETAAPDGYEVATAITFTVNE QGQVTVNGEATKGDAHTAS

III. SARS-CoV-2 RBD Based Vaccines

The sequence of SARS-CoV-2 S protein and amino acid numbering are notedabove. The SARS-CoV-2 RBD used in RBD-based vaccine design is defined asP330-N532 (see SEQ ID NO:6). A trimerization motif, the viral capsidprotein SHP (PDB: 1TD0), can be added to the C-terminus of SARS-CoV-2RBD with a short 5GS linker in between to stabilize RBD in a trimericconformation. A His6-tag can be added to the C-terminus of thetrimerization motif with a 1GS linker to facilitate purification.

SpyTag and SpyCatcher can be attached to SARS-CoV-2 RBD and ananoparticle subunit in different combinations to facilitate themultivalent display of RBD on nanoparticle. For example, if theC-terminus of RBD is fused to the N-terminus of SpyTag with a 5GSlinker, the C-terminus of SpyCatcher can be fused to the N-terminus of ananoparticle subunit with a 5GS linker to create a pair. SpyTag andSpyCatcher can be switched in these two constructs to create a differentpair. SpyTag or SpyCatcher can also be fused to the N-terminus of RBDwith a 5GS linker. When the two constructs are introduced into andexpressed in the host cells, a recombinant vaccine protein will beformed through the binding between the SpyTag and SpyCatcher motifs.

SARS-CoV-2 RBD (SEQ ID NO:6):

PNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVC GPKKSTN

SpyTag: VPTIVMVDAYKRYK (SEQ ID NO:16).

SpyCatcher: SEQ ID NO:17:

AMVTTLSGLSGEQGPSGDMTTEEDSATHIKFSKRDEDGRELAGATMELRDSSGKTISTWISDGHVKDFYLYPGKYTFVETAAPDGYEVATAITFTVNE QGQVTVNGEATKGDAHTAS

Example 3 Production and Purification of S Trimers and RBD Domains

Cell line: All constructs were expressed in HEK293 F cells and ExpiCHOcells, with ExpiCHO showing significantly higher yield.

Purification: After transient expression, S antigens were purified fromthe supernatant using three methods including the His6-tag/nickel columnand antigen-specific antibody column. The S230 and CR3022 antibodycolumns can be used to purify SARS-CoV S and RBD antigens andnanoparticles. The MCA1 antibody column can be used to purify MERS-CoV Sand RBD antigens and nanoparticles; The CR3022 antibody column can beused to purify SARS-CoV-2 S and RBD antigens and nanoparticles.

Example 4 Rational Design of Scaffolded RBD Trimer and RBD-PresentingSApNPs

We hypothesized that RBD attached to a trimeric scaffold can mimic the“RBD-up” spike conformation and elicit NAbs to block ACE2 binding. Totest this possibility, we designed a fusion construct containingSARS-CoV-1/2 RBD, a short 5-aa G₄S linker (with a 2-aa restrictionsite), and a trimeric viral capsid protein, SHP (PDB: 1TD0). Structuralmodeling showed that the three tethered RBDs form a triangle of 92 Å(measured for L492), which is 14 and 18 Å wider than the SARS-CoV-1“two-RBD-up” spike (PDB: 6CRX, measured for L478) and the MERS-CoV“all-RBD-up” spike (PDB: 5X59, measured for L506), respectively,allowing NAb access to each RBD. We then developed an immunoaffinitychromatography (IAC) column to facilitate tag-free vaccine purification.Previously, NAb-derived IAC columns have been used to purify HIV-1 Envtrimers/NPs, hepatitis C virus (HCV) E2 cores/NPs, and Ebola virus(EBOV) GP trimers/NPs. It was reported that a SARS-CoV-1 NAb, CR3022,can bind SARS-CoV-2 RBD (Tian et al., Emerg. Microbes Infect. 9,382-385, 2020). The SARS-CoV-2 RBD/CR3022 structure revealed the epitopeshared by two SARS-CoVs and alluded to a breathing motion of the spikethat enables CR3022 binding to RBD. Here, we examined the utility ofCR3022 in IAC columns. The SARS-CoV-1/2 RBD-5GS-1TD0 constructs weretransiently expressed in 100-ml ExpiCHO cells and purified on a CR3022antibody column prior to size-exclusion chromatography (SEC) using aSuperdex 200 10/300 GL column. While the SARS-CoV-1 RBD construct showedboth aggregate (8.6 ml) and trimer (12.7 ml) peaks in the SEC profile,the SARS-CoV-2 RBD construct produced a single trimer peak at 12.8 ml.In sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE),a monomer band of ˜37 kD and a trimer band of ˜100 kD were observedunder reducing and non-reducing conditions, respectively. Antigenicitywas assessed for the two scaffolded RBD trimers in enzyme-linkedimmunosorbent assay (ELISA) after CR3022/SEC purification. RBD-specificNAbs targeting SARS-CoV-1 (CR3022, m396, 80R, and 5230) and SARS-CoV-2(B38, CB6, 5309 from a SARS survivor, and P2B-2F6), were tested inELISA. Overall, similar half maximal effective concentration (EC₅₀)values were observed for the two RBD trimers binding to their respectiveNAbs. The SARS-CoV-1 RBD trimer showed greater binding affinity forCR3002 than its SARS-CoV-2 counterpart with a 1.3-fold difference in theEC₅₀ value. Of the SARS-CoV-2 NAbs, B38 yielded a similar EC₅₀ value toCR3022. The kinetics of antibody binding was measured using biolayerinterferometry (BLI). Overall, all tested antibodies exhibited a faston-rate but with visible differences in their off-rates. For example,B38 showed a faster off-rate than other SARS-CoV-2 NAbs, while CR3022,the antibody used to purify SARS-CoV-1/2 RBD proteins, exhibitedcomparable kinetic profiles.

We then hypothesized that the SpyTag/SpyCatcher (or simply SPY) systemcan be used to conjugate RBD to SApNPs to create multivalent RBDvaccines capable of eliciting a more potent NAb response. The 13-aaSpyTag spontaneously reacts with the SpyCatcher protein to form anirreversible isopeptide bond. The SPY system has been used to attachantigens to SApNPs and VLPs. Here, SpyTag was fused to the C terminus ofRBD, while SpyCatcher was fused to the N terminus of an SApNP subunit,both with a 5-aa G₄S linker. This design was first tested for FR. Wecompared two production strategies—co-expression of RBD-5GS-SpyTag andSpyCatcher-5GS-FR versus supernatant mix after separate expression—andperformed purification on a CR3022 column. Protein obtained fromtransient transfection in 50-ml ExpiCHO cells was analyzed by SEC on aSuperose 6 10/300 GL column. Both production strategies produced a peak(12 ml) corresponding to SApNPs. While the SARS-CoV-2 construct notablyoutperformed its SARS-CoV-1 counterpart in particle yield (0.6-1.0 mgversus 0.3-0.5 mg after CR3022/SEC), supernatant mix appeared to besuperior to co-expression. Nonetheless, the results suggest that bothstrategies can be used to produce RBD-conjugated SApNPs in Goodmanufacturing practice (GMP)-compatible Chinese hamster ovary (CHO)cells. Antigenicity was assessed for SEC-purified RBD-5GS-SPY-5GS-FRSApNPs. In ELISA, RBD-presenting SApNPs showed slightly improved mAbbinding, as indicated by lower EC₅₀ values. In BLI, a more pronouncedeffect of multivalent display on antigenicity was observed, showingnotably increased binding signals and plateaued dissociation.

Structural integrity of various RBD SApNPs was analyzed by negativestain EM. For SARS-CoV-1, an RBD-10GS-FR construct was included forcomparison that produced very few SApNPs. In contrast, theRBD-5GS-SPY-5GS-FR construct produced SApNPs with visible surfacedecorations. For SARS-CoV-2, the purified RBD-5GS-SPY-5GS-FR SApNPs,irrespective of the production strategy, showed morphologiescorresponding to well-formed nanoparticles. Following a similarstrategy, SARS-CoV-1/2 RBDs were also attached to a multilayered I3-01v9SApNP (He et al., bioRxiv, 2020.2008.2022.262634, 2020). Despite themodest yield, large SApNPs were observed in EM.

In summary, we demonstrate the utility of the SPY system for rapiddevelopment of RBD-based SApNP vaccines. Compared to the two-componentRBD SApNPs, the SPY-linked RBD SApNPs presented here may be moreadvantageous in terms of stability and manufacturability.

Example 5 Rational Design of Prefusion Spike Through MinimizingMetastability

It is imperative to understand the SARS-CoV-2 spike metastability, andbased on which, to design the optimal spike as a vaccine antigen. Wefirst created the His-tagged, uncleaved spike ectodomain (S_(ECTO))constructs for SARS-CoV-1/2, both containing the 2P mutation and atrimerization motif (1TD0) fused to the C terminus with a G₄S linker.The two constructs were transiently expressed in 50-ml ExpiCHO cellsfollowed by purification on a Nickel column or a CR3022 column. TheS2P_(ECTO)-5GS-1TD0-His₆ protein was characterized by SEC on a Superose6 10/300 GL column. After Nickel column, both S2P_(ECTO) constructsshowed a trimer peak (˜12 ml) with shoulders to the left and rightindicative of aggregate and dimer/monomer species, respectively. CR3022purification resulted in a consistent trimer peak and less dimer/monomerspecies. We then tested a pair of S_(ECTO) constructs containing adouble glycine mutation (V1060G/L1061G, termed 2G). The 2G mutation hadlittle effect on the SARS-CoV-1 spike but produced an abnormal SECprofile and showed no yield for the SARS-CoV-2 spike after purificationby Nickel and CR3022 columns, respectively. Lastly, we tested a pair ofS2G variants without the HR2 stalk (E1150-Q1208), termed S2GΔHR2.Deletion of the HR2 stalk restored the SARS-CoV-2 trimer peak andreduced aggregates for both SARS-CoVs, as shown by the SEC profiles uponCR3022 purification.

We hypothesized that HR2 may be a key determinant of SARS-CoV spikemetastability. It is possible that the interactions between HR1 and HR2of two neighboring spikes may facilitate the pre-to-post-fusiontransition in addition to ACE2 binding and S1 dissociation. Given theextensive mutations in HR1 (9 in total) compared to SARS-CoV-1, wesought to examine the role of HR1 in SARS-CoV-2 spike metastability withtwo HR1-swapped spike constructs. Interestingly, while HR1 swappingproved ineffective, deletion of the HR2 stalk once again restore thetrimer peak. Therefore, S2GΔHR2 provides a general spike design forSARS-CoV-1/2 and perhaps other CoVs. Four separate production runs ofSARS-CoV-2 S2GΔHR2-5GS-1TD0 in 300-ml ExpiCHO cells resulted in nearlyidentical SEC profiles with a trimer yield of 0.8-1.0 mg. Blue nativepolyacrylamide gel electrophoresis (BN-PAGE) confirmed the purity of theS2GΔHR2 spike across SEC fractions. Antigenicity was assessed forfreshly produced SARS-CoV-2 S2P_(ECTO) and S2GΔHR2 spikes. In ELISA, theS2GΔHR2 spike showed consistently higher affinity for the fiverepresentative mAbs than the S2P_(ECTO) spike. When tested against threenewly identified NAbs, C105 and CC12.1/CC12.3, the two spikes yieldedsimilar EC₅₀ values. In BLI, the S2GΔHR2 spike showed higher bindingsignals than the S2P_(ECTO) spike at the highest concentration, whileexhibiting similar binding kinetics. The use of NAb P2B-2F6 for spikepurification resulted in much higher trimer yield with similar purity tothe CR3022 column across SEC fractions.

Together, we demonstrate that deletion of the HR2 stalk may improvespike properties and S2GΔHR2 may provide a better spike antigen thatimproves on the 2P mutation.

Example 6 Rational Design of Single-Component and Multilayered SApNPs

Although it was proven possible to conjugate trimeric SARS-CoV-2 spikesto an SApNP using the SPY system, the random and irreversible chemicallinking will result in irregular display with unoccupied but spatiallyoccluded anchoring sites on the surface. The SPY system is perhaps moresuitable for individual antigens such as RBD. We therefore set out toobtain rational design of single-component, multilayered,self-assembling spike nanoparticles, using the gene fusion approach.

Native SARS-CoV-2 virions present both pre- and post-fusion spikes onthe surface. Our vaccine strategy aims to develop single-component,multilayered SApNPs that each present 8 or 20 stable S2GΔHR2 spikes tothe immune system. To explore this possibility, we modeled the S2GΔHR2spike on FR with a 5-aa G₄S linker, on E2p with a 5-aa G₄S linker, andon I3-01v9 with a 10-aa (G₄S)₂ linker, resulting in large SApNPs withdiameters of 47.9 nm, 55.9 nm, and 59.3 nm, respectively. The threeS2GΔHR2 SApNP constructs were transiently expressed in 400-ml ExpiCHOcells followed by CR3022 purification and SEC on a Superose 6 10/300 GLcolumn. Three separate production runs generated highly consistent SECprofiles for all three constructs, despite the variation of low-m.w.impurities observed for FR and E2p SApNPs. Following CR3022/SECpurification, we obtained on average 0.3-0.4 mg, 0.15-0.25 mg, and0.3-0.35 mg SApNP for S2GΔHR2-5GS-FR, S2GΔHR2-5GS-E2p-LD4-PADRE (orE2p-L4P), and S2GΔHR2-10GS-I3-01v9-LD7-PADRE (or I3-01v9-L7P). Overall,S2GΔHR2-10GS-I3-01v9-L7P appeared to be the best performer in terms ofyield, purity, and stability in production.

The structural integrity of CR3022/SEC-purified SApNPs was characterizedby negative stain EM, which showed well-formed particles in the range of40-60 nm, consistent with the modeling. Spikes could be readilyrecognized on the SApNP surface. Antigenicity of S2GΔHR2-presentingSApNPs was assessed using the same panel of mAbs/NAbs. In ELISA, threeSApNPs showed slightly improved binding to some, but not all, of theantibodies compared to the individual spike. In BLI, we observed a clearcorrelation between peak binding signal and antigen valency, with aranking of E2p/I3-01v9>FR>spike. Display on the two 60-merssignificantly improved antibody binding compared to the 24-mer, FR.

In summary, these large VLP-size SApNPs with 8 or 20 spikes on thesurface provide promising vaccine candidates for in vivo evaluation.

Example 7 SARS-CoV-1/2 Vaccine-Induced Binding Antibody Response

Selected SARS-CoV-1/2 RBD- and spike-based immunogens were evaluated inBALB/c mice to evaluate vaccine-induced antibody response (FIG. 2 ).Groups of five mice were immunized four times with three-week intervals.All vaccine antigens were formulated with AddaVax, an oil-in-wateremulsion adjuvant, except for the I3-01v9 SApNP, which was formulatedwith aluminum phosphate (AP). We first performed a longitudinal analysisof binding antibody response as measured by half maximal effectivedilution (ED₅₀) in the two SARS-CoV-2 RBD vaccine groups. Results fromthe study are shown in FIG. 3 . The RBD SApNP (RBD-5GS-SPY-5GS-FR)elicited significantly higher ED₅₀ titers than the scaffolded RBD trimer(RBD-5GS-1TD0) at w2 and w5, irrespective of the coating antigen, andshowed a P value of 0.0009 at w8 when RBD was coated. Compared to thestabilized spike (S2GΔHR2-5GS-1TD0), the RBD SApNP elicitedsignificantly higher ED₅₀ titers against RBD at w2, w5, and w8,demonstrating a strong “epitope-focusing” effect. Mouse sera bound theSARS-CoV-1 spike with lower ED₅₀ titers than the SARS-CoV-2 spike butwith similar patterns). We then performed a longitudinal analysis ofbinding antibody response induced by two SARS-CoV-2 spikes,S2P_(ECTO)-5GS-1TD0 and S2GΔHR2-5GS-1TD0, and three SApNPs eachdisplaying 8 or 20 S2GΔHR2 spikes. The S2GΔHR2 spike elicited 2˜3-foldhigher average ED₅₀ titers than the S2P_(ECTO) spike irrespective of thecoating antigen, showing greater immunogenicity (of note, to facilitatea fair comparison, mouse sera from the two spike groups were testedagainst their respective spikes).

Three SApNPs exhibited different temporal patterns depending on thecoating antigen. When spike was used as the coating antigen, the I3-01v9group showed a steady increase in average ED₅₀ titer over time. ThisSApNP yielded the highest average ED₅₀ titer at two time points, w2 andw8, and significantly outperformed the S2P_(ECTO) spike at all timepoints. The smaller FR exhibited a similar temporal pattern with loweraverage ED₅₀ titers, which are still significantly higher than that ofthe S2P_(ECTO) group. Among the three SApNPs, E2p registered the lowestaverage ED₅₀ titer at w2 and reached the highest at w5, which decreasedslightly at w8. In terms of RBD-specific response, the five groupsshowed a clear ranking based on their average ED₅₀ titers, whichremained consistent across time points. At w2, I3-01v9 elicited anaverage ED₅₀ titer of 175, whereas all other spike-based vaccine groupsshowed little RBD-specific response. At w5 and w8, S2GΔHR2 elicitedhigher ED₅₀ titers (on average by 2-fold) than S2P_(ECTO), while allthree SApNPs outperformed the individual S2GΔHR2 spike with a ranking ofED₅₀ titers correlated with their size (FR<E2p<I3-01v9). Sera reactedwith the SARS-CoV-1 spike similarly, albeit at a lower level.

Lastly, we compared binding antibody responses induced by threeSARS-CoV-1 vaccines—the S2P_(ECTO) spike (S2P_(ECTO)-5GS-1TD0), thescaffolded RBD trimer (RBD-5GS-1TD0), and the RBD SApNP(RBD-5GS-SPY-5GS-FR). Based on the ED₅₀ titers, the SARS-CoV-1S2P_(ECTO) spike appeared to be more immunogenic than the SARS-CoV-2S2GΔHR2 spike, whereas the SARS-COV-1 RBD SApNP was less advantageousthan its SARS-COV-2 counterpart. Serum reactivity with the SARS-CoV-2S2P_(ECTO) spike was observed for all three SARS-CoV-1 vaccine groups.

Our results thus indicate that RBD SApNPs can elicit RBD-specificantibody titers at a similar or higher level compared to the spike.Furthermore, the S2GΔHR2 spike is more immunogenic than the widely usedS2P_(ECTO) spike, in addition to its superior in-vitro properties. Thelarge multilayered E2p and I3-01v9 SApNPs are the best performers amongall the spike-based vaccines, consistent with the findings in ourprevious HIV-1, HCV, and Ebola vaccine studies.

Example 8 SARS-CoV-1/2 Vaccine-Induced NAb Response

One major goal in COVID-19 vaccine development is to generate a potentNAb response that can protect against SARS-CoV-2 infection.Pseudoparticle (SARS-CoV-1/2-pp) neutralization assays were used toevaluate serum NAb responses elicited by different vaccine candidates.As indicated by the results shown in FIG. 4 , we first performed alongitudinal analysis of NAb response as measured by half maximalinhibitory dilution (ID₅₀) in the two SARS-CoV-2 RBD vaccine groups. TheRBD SApNP elicited low titers of NAb response against autologousSARS-CoV-2 at as early as w2 and retained its advantage at the two latertime points, suggesting that such RBD SApNP vaccines can elicit a rapidNAb response upon vaccination. The scaffolded RBD trimer group showedthe lowest average ID₅₀ titer at w5 but a NAb response comparable tothat induced by the stabilized S2GΔHR2 spike at w8. A somewhat differentpattern was observed in the SARS-CoV-1-pp assay. At the first timepoint, w2, no vaccine groups showed any detectable heterologous NAbresponse. At w5 and w8, the S2GΔHR2 spike elicited a more potentSARS-CoV-1 NAb response than both RBD-based vaccines, suggesting thatnon-RBD epitopes may contribute to the cross-neutralization.

We then performed a longitudinal analysis of NAb responses induced byfive spike-based vaccines. In terms of autologous neutralization, nospike-based vaccine elicited any SARS-CoV-2-pp NAb response at w2 afterthe first injection. But a consistent pattern was observed for serumneutralization at w5 and w8: the S2P_(ECTO) spike used in almost allvaccine candidates currently in human trials showed the lowest averageID₅₀ titers, 879 and 2481 at w5 and w8, respectively; the newly designedS2GΔHR2 spike induced a stronger NAb response than the S2P_(ECTO) spikewith 2.8-6.7-fold higher average ID₅₀ titers, confirming the beneficialeffect of the 2P-to-2G substitution and deletion of the HR2 stalk; amongthe three SApNPs, E2p was the best performer at w5, showing an averageID₅₀ titer of 8493 that is 9.7-fold higher than S2P_(ECTO) and 1.4-foldhigher than S2GΔHR2, while I3-01v9 showed the most potent NAb responseat w8 with an average ID₅₀ titer of 17351 that is 7-fold and 2.5-foldhigher than S2P_(ECTO) and S2GΔHR2, respectively. A similar temporalpattern of NAb response was observed in the heterologous SARS-CoV-1-ppassay. It is worth noting that the I3-01v9 SApNP elicited a SARS-CoV-1NAb response with an average ID₅₀ titer of 351 at w2, whereas all othergroups showed no detectable neutralization. Nonetheless, our resultssuggest that the SARS-CoV-2 S2GΔHR2-based vaccines, particularly SApNPs,may provide protection against both SARS-CoV-1/2. Lastly, we performed alongitudinal analysis of NAb responses induced by three SARS-CoV-1vaccines. In the autologous SARS-CoV-1-pp assay, the S2P_(ECTO) spikeand the RBD SApNP induced significantly more potent NAb responses thanthe scaffolded RBD trimer at w2 and w5 and all three vaccine groupsshowed similar ID₅₀ titers at w8. However, heterologous SARS-CoV-2neutralization was below or at the baseline level for three SARS-CoV-1vaccines at w2, w5, and w8.

Our results thus demonstrate the advantage of the S2GΔHR2 spike andS2GΔHR2-presenting SApNPs with respect to the S2P_(ECTO) spike in NAbelicitation. While the SARS-CoV-2 RBD- and S2GΔHR2-presenting SApNPs arecomparable in eliciting SARS-CoV-2-specific NAb response, the latter mayprovide a broader protection against SARS-associated CoVs.

Example 9 T-Cell Response and Vaccine Safety

While the humoral immunity is required to block host-virus interactionand prevent viral infection, the cellular immunity is essential foreliminating infected host cells to control viral infection. Emergingevidence indicates that an early T-cell response, as well as T-cellmemory, is critical for protection against SARS-CoV-2. However, COVID-19vaccines must induce a CD4⁺ T helper 1 (Th1), but not Th2-type, T-cellresponse, as the latter has been linked to vaccine-associatedenhancement of respiratory disease (VAERD). In addition, T follicularhelper cells (Tfh) play an important role in the maturation andproduction of NAbs. Therefore, understanding T-cell response is crucialfor the development of an effective and safe COVID-19 vaccine.

Interferon (IFN)-γ-producing Th1 cells are important for generating anoptimal antibody response and for the induction of cellular immunity toclear viruses. We first examined the impact of various SARS-CoV-2vaccine formulations on the induction of CD4⁺ Th1 responses specific tothe spike protein at w11—two weeks after the fourth immunization, whenmemory T cells had already developed in spleen. Mouse splenocytes fromthe S2P group and two SApNP groups (E2p and I3-01v9) were analyzed byflow cytometry (FC) using naïve samples as a negative control. Resultsfrom the studies are shown in FIG. 5 . I3-01v9 induced approximately1.5- and 2.3-fold higher frequency of IFN-γ-producing CD4⁺ Th1 cellsthan S2P and E2p, respectively. Notably, following re-stimulation withthe respective antigens for as few as 4 hours, both E2p and I3-01v9groups produced ˜2-fold higher frequency of CD107a-producing cytolyticCD4⁺ T cells than the S2P and naïve control groups. IFN-γ/IL-4(interleukin-4) double-positive cells are memory CD4⁺ T cells that haveacquired the ability to produce IL-4 while still retaining the abilityto produce IFN-γ under Th1 conditions. It appeared that I3-01v9 induced3- and 5-fold more IFN-γ/IL-4 double-positive memory CD4⁺ T cells thanS2P and E2p. These results suggest that I3-01v9 can induce both potentCD4⁺ Th1 cells and IFN-γ/IL-4 double-positive memory CD4⁺ T cells.

In addition, I3-01v9 induced more IFN-γ/GM-CSF (granulocyte-macrophagecolony-stimulating factor) double-positive CD8⁺ effector T cells thanS2P and E2p, as shown in FIG. 5 . These data suggest that protectiveCD8⁺ T cell responses were also generated in mice immunized with theI3-01v9 SApNP. Of note, CD8⁺ T cells derived from mice immunized withI3-01v9, rather than those with S2P and E2p, acquired the ability torapidly produce IFN-γ upon antigen re-stimulation, suggesting thegeneration of I3-01v9-responsive effector/memory T cells. Together, ourfindings indicate that the S2GΔHR2 I3-01v9 SApNP can induce potentT-cell responses in mice consisting of CD4⁺ Th1 cells, IFN-γ/IL-4double-positive memory CD4⁺ T cells, and CD8⁺ T cells, thus providingprotective cellular immunity required for an effective vaccine againstSARS-CoV-2.

Example 10 Some Exemplified Methods

Design, expression and purification of SARS-CoV-2 RBD and spikeantigens: The spike (S) genes of the SARS-CoV-1 isolate Tor2 (GenBankaccession #: NC_004718) and the SARS-CoV-2 isolate Wuhan-Hu-1 (GenBankaccession #: MN908947) were used to design all the RBD and spikeconstructs following codon-optimization for expression in mammaliancells. The RBD sequence is defined as P317-D518 and P330-N532 forSARS-CoV-1 and 2, respectively. The S_(ECTO) sequence is defined asM1-Q1190 and M1-Q1208 for SARS-CoV-1 and 2, respectively. To remove theS1/S2 cleavage site, an R667G mutation and a ⁶⁸²GSAGSV⁶⁸⁷ (SEQ ID NO:18)modification were introduced in the SARS-CoV-1 and 2 spikes,respectively. The 2P (or 2G) mutation was made to K968/V969 andK986/V987 in the SARS-CoV-1 and 2 spikes, respectively. The SARS-CoV-2C-terminal region (E1150-Q1208) containing the HR2 stalk was removedfrom S2G_(ECTO), resulting in an HR2-deleted spike construct termedS2GΔHR2. The viral capsid protein SHP (PDB: 1TD0) was used as atrimerization motif in spike constructs for immunization, whereas thefoldon domain from the bacteriophage T4 fibritin (PDB: 1RFO) was used incoating spike antigens for ELISA to mask the 1TD0-derived antibodyresponse. All constructs were transiently expressed in ExpiCHO cells(Thermo Fisher). Briefly, ExpiCHO cells were thawed and incubated withExpiCHO™ Expression Medium (Thermo Fisher) in a shaker incubator at 37°C., 135 rpm and 8% CO₂. When the cells reached a density of 10×10⁶ ml⁻¹,ExpiCHO™ Expression Medium was added to reduce cell density to 6×10⁶ml⁻¹ for transfection. The ExpiFectamine™ CHO/plasmid DNA complexes wereprepared for 100-ml transfection in ExpiCHO cells following themanufacturer's instructions. For a given construct, 100 μg of plasmidand 320 μl of ExpiFectamine™ CHO reagent were mixed in 7.7 ml of coldOptiPRO™ medium (Thermo Fisher). After the first feed on day one,ExpiCHO cells were cultured in a shaker incubator at 33° C., 115 rpm and8% CO₂ following the Max Titer protocol with an additional feed on dayfive (Thermo Fisher). Culture supernatants were harvested 13 to 14 daysafter transfection, clarified by centrifugation at 4000 rpm for 25 min,and filtered using a 0.45 μm filter (Thermo Fisher). The CR3022 antibodycolumn was used to extract SARS-CoV-1/2 antigens from the supernatants,which was followed by SEC on a Superdex 200 10/300 GL column (forscaffolded RBD trimer) or a Superose 6 10/300 GL column (forRBD-SPY-NPs, spikes, and spike-presenting NPs). For comparison,His-tagged S_(ECTO)-5GS-1TD0 spike protein was extracted from thesupernatants using an immobilized Ni Sepharose™ Excel column (GEHealthcare) and eluted with 500 mM Imidazole prior to SEC. Proteinconcentration was determined using UV₂₈₀ absorbance with theoreticalextinction coefficients.

Blue native polyacrylamide gel electrophoresis: SARS-CoV-2 spikes andspike-presenting NPs were analyzed by blue native polyacrylamide gelelectrophoresis (BN-PAGE) and stained with Coomassie blue. The proteinswere mixed with sample buffer and G250 loading dye and added to a 4-12%Bis-Tris NativePAGE™ gel (Life Technologies). BN-PAGE gels were run for2 to 2.5 hours at 150 V using the NativePAGE™ running buffer (LifeTechnologies) according to the manufacturer's instructions.

Enzyme-linked immunosorbent assay: Each well of a Costar™ 96-well assayplate (Corning) was first coated with 50 μl PBS containing 0.2 μg of theappropriate antigens. The plates were incubated overnight at 4° C., andthen washed five times with wash buffer containing PBS and 0.05% (v/v)Tween 20. Each well was then coated with 150 μl of a blocking bufferconsisting of PBS, 40 mg ml⁻¹ blotting-grade blocker (Bio-Rad), and 5%(v/v) FBS. The plates were incubated with the blocking buffer for 1 hourat room temperature, and then washed five times with wash buffer. Forantigen binding, antibodies were diluted in the blocking buffer to amaximum concentration of 5 μg ml⁻¹ followed by a 10-fold dilutionseries. For each antibody dilution, a total of 50 μl volume was added tothe appropriate wells. For mouse sample analysis, serum or plasma wasdiluted by 20-fold in the blocking buffer and subjected to a 10-folddilution series. For each sample dilution, a total of 50 μl volume wasadded to the wells. Each plate was incubated for 1 hour at roomtemperature, and then washed 5 times with PBS containing 0.05% Tween 20.For antibody binding, a 1:5000 dilution of goat anti-human IgG antibody(Jackson ImmunoResearch Laboratories, Inc), or for mouse sampleanalysis, a 1:3000 dilution of horseradish peroxidase (HRP)-labeled goatanti-mouse IgG antibody (Jackson ImmunoResearch Laboratories), was thenmade in the wash buffer (PBS containing 0.05% Tween 20), with 50 μl ofthis diluted secondary antibody added to each well. The plates wereincubated with the secondary antibody for 1 hour at room temperature,and then washed 5 times with PBS containing 0.05% Tween 20. Finally, thewells were developed with 50 μl of TMB (Life Sciences) for 3-5 minbefore stopping the reaction with 50 μl of 2 N sulfuric acid. Theresulting plate readouts were measured at a wavelength of 450 nm. Ofnote, the w2 serum binding did not reach the plateau (or saturation) toallow for accurate determination of ED₅₀ titers. Nonetheless, the ED₅₀values at w2 were derived by setting the lower/upper constraints ofOD₄₅₀ at 0.0/3.2 to facilitate the comparison of different vaccinegroups at the first time point.

Bio-layer interferometry: The kinetics of SARS-CoV-1/2 vaccine antigens,RBD versus RBD-presenting NPs as well as spike versus spike-presentingNPs, binding to a panel of known antibodies was measured using an OctetRED96 instrument (FortéBio, Pall Life Sciences). All assays wereperformed with agitation set to 1000 rpm in FortéBio 1× kinetic buffer.The final volume for all the solutions was 200 μl per well. Assays wereperformed at 30° C. in solid black 96-well plates (Geiger Bio-One). Forall antigens with the exception of S2GΔHR2-NPs, 5 μg ml⁻¹ of antibody in1× kinetic buffer was loaded onto the surface of anti-human Fc CaptureBiosensors (AHC) for 300 s. For S2GΔHR2-NPs, anti-human Fc QuantitationBiosensors (AHQ) were used. A 60 s biosensor baseline step was appliedprior to the analysis of the association of the antibody on thebiosensor to the antigen in solution for 200 s. A two-fold concentrationgradient of antigen, starting at 950 nM for scaffolded RBD trimers, 37nM for RBD-5GS-SPY-5GS-FR NP, 150 nM for spike trimers, and 9/3.5/3.5 nMfor S2GΔHR2 presented on FR/E2p/I3-01v9 NPs, was used in a titrationseries of six. The dissociation of the interaction was followed for 300s. Correction of baseline drift was performed by subtracting the meanvalue of shifts recorded for a sensor loaded with antibody but notincubated with antigen and for a sensor without antibody but incubatedwith antigen. Octet data were processed by FortéBio's data acquisitionsoftware v.8.1. Experimental data were fitted with the binding equationsdescribing a 2:1 interaction to achieve optimal fitting. Of note,S2GΔHR2 trimer binding was also measured using AHQ to facilitate thecomparison of antibody binding with S2GΔHR2-presenting NPs.

Electron microscopy (EM) assessment of nanoparticle constructs: Theinitial EM analysis of RBD and S2GΔHR2-presenting NPs was conducted atthe Core Microscopy Facility at The Scripps Research Institute. Briefly,NP samples were prepared at the concentration of 0.01 mg/ml.Carbon-coated copper grids (400 mesh) were glow-discharged and 8 μL ofeach sample was adsorbed for 2 min. Excess sample was wicked away andgrids were negatively stained with 2% uranyl formate for 2 min. Excessstain was wicked away and the grids were allowed to dry. Samples wereanalyzed at 80 kV with a Talos L120C transmission electron microscope(Thermo Fisher) and images were acquired with a CETA 16M CMOS camera.

Animal immunization and sample collection: Similar immunizationprotocols have been reported in our previous NP vaccine studies.Briefly, the Institutional Animal Care and Use Committee (IACUC)guidelines were followed with animal subjects tested in the immunizationstudy. Eight-week-old BALB/c mice were purchased from The JacksonLaboratory and housed in ventilated cages in environmentally controlledrooms at The Scripps Research Institute, in compliance with an approvedIACUC protocol and AAALAC (Association for Assessment and Accreditationof Laboratory Animal Care) International guidelines. Mice were immunizedat weeks 0, 3, 6, and 9 with 200 μl of antigen/adjuvant mix containing50 μg of vaccine antigen and 100 μl of adjuvant, AddaVax or Adju-Phos(InvivoGen), via the intraperitoneal (i.p.) route. Blood was collectedtwo weeks after each immunization. All bleeds were performed through theretro-orbital sinus using heparinized capillary tubes into EDTA-coatedtubes. Samples were diluted with an equal volume of PBS and thenoverlaid on 4.5 ml of Ficoll in a 15 ml SepMate™ tube (STEMCELLTechnologies) and spun at 1200 RPM for 10 min at 20° C. to separateplasma and cells. The plasma was heat inactivated at 56° C. for 30 min,spun at 1200 RPM for 10 min, and sterile filtered. The cells were washedonce in PBS and then resuspended in 1 ml of ACK Red Blood Cell lysisbuffer (Lonza). After washing with PBS, peripheral blood mononuclearcells (PBMCs) were resuspended in 2 ml of Bambanker Freezing Media(Lymphotec). Spleens were also harvested and ground against a 70-μm cellstrainer (BD Falcon) to release the splenocytes into a cell suspension.Splenocytes were centrifuged, washed in PBS, treated with 5 ml of ACKlysing buffer (Lonza), and frozen with 3 ml of Bambanker freezing media.Sera were heat inactivated for ELISA binding and pseudovirusneutralization assays.

SARS-CoV-1/2 pseudovirus neutralization assay: Pseudoparticle(SARS-CoV-1/2-pp) neutralization assays were utilized to assess theneutralizing activity of previously reported antibodies andvaccine-induced murine antibody response. SARS-CoV-1/2-pps weregenerated by co-transfection of HEK293T cells with the HIV-1pNL4-3.lucR-E-plasmid (the NIH AIDS reagent program) and the expressionplasmid encoding the S gene of SARS-CoV-1 isolate Tor2 (GenBankaccession #: NC_004718) and the SARS-CoV-2 isolate Wuhan-Hu-1 (GenBankaccession #: MN908947) at a 4:1 ratio by lipofectamine 3000 (ThermoFisher Scientific). After 48 to 72 hours, SARS-CoV-1/2-pps werecollected from the supernatant by centrifugation at 4000 rpm for 10 min,aliquoted, and stored at −80° C. before use. The mAbs at a startingconcentration of 0.1-10 μg/ml, or mouse serum at a starting dilution of100-fold, were mixed with the supernatant containing SARS-CoV-1/2-ppsand incubated for 1 hour at 37° C. in white solid-bottom 96-well plate(Corning). A 3-fold dilution series was used in the assay. TheHEK293T-hACE2 cell line (catalogue #: NR-52511) and the vectorpcDNA3.1(−) containing the SARS-CoV-2 spike gene (catalogue #: NR52420)were obtained from BEI RESOURCES and used in pseudovirus neutralizationassays. Briefly, HEK293T-hACE2 cells at 1×10⁴ were added to each welland the plate was incubated at 37° C. for 48 hours. After incubation,overlying media was removed, and cells were lysed. The fireflyluciferase signal from infected cells was determined using theBright-Glo Luciferase Assay System (Promega) according to themanufacturer's instructions. Data were retrieved from a BioTekmicroplate reader with Gen 5 software, the average backgroundluminescence from a series of uninfected wells was subtracted from eachwell, and neutralization curves were generated using GraphPad Prism8.4.3, in which values from wells were compared against a wellcontaining SARS-CoV-1/2-pp only. The same HIV-1 vectors pseudotyped withthe murine leukemia virus (MLV) Env gene, termed MLV-pps, were producedin HEK293T cells and included in the neutralization assays as a negativecontrol.

Dendritic cell (DC) production: Mouse bone marrow (BM) was cultured inRPMI 1640 medium containing 10% fetal bovine serum and recombinant mouseFlt3L (50 ng/mL) and SCF (10 ng/ml) for 9 days. To induce DC activation,immature DCs were incubated with lipopolysaccharide (LPS, 100 ng/mL),R848 (Resiquimod, 100 ng/mL) or CpG (ODN 1585, 1 μM) overnight, whichactivated Toll-like receptor (TLR)4, TLR7/8 or TLR9 signaling,respectively. Cells were harvested for experiments. pDCs were sorted toisolate CD11c+B220+ cells using FACS cell sorter and magnetic beads(Miltenyi-Biotech, CA).

Antibodies (Abs) and flow cytometry analysis: All antibodies used forimmunofluorescence staining were purchased from eBioscience (San Diego,CA), BioLegend (San Diego, CA) or BD Biosciences (San Jose, CA).Magnetic microbead-conjugated Abs and streptavidin were purchased fromMiltenyi-Biotech (Auburn, CA). Recombinant human IL-2 protein waspurchased from R&D Systems (Minneapolis, MN). Recombinant mouse Flt3ligand (Flt3L) and mouse SCF were purchased from Shenandoah Biotech(Warwick, PA). Cells were stained with appropriate concentrations ofmAbs. Dead cells were excluded using Fixable Viability Dye fromeBioscience (San Diego, CA). Flow cytometry analyses were performedusing LSRII (BD Bioscience, CA) and Canto cytometers (Becton Dickinson,NJ). Cell were sorted on BD FACSAria II (BD Bioscience, CA).

T cell culture and activation: Splenic mononuclear cells from immunizedmice were cultured in the presence of DCs pulsed with or without S2P,E2P and I3-01 in complete IMDM medium containing IL-2 (5.0 ng/ml). Cellswere collected 16 hours later for intracellular cytokine staining andflow cytometric analysis.

Statistics: In antibody titer analysis, comparison of different vaccinegroups was performed in GraphPad Prism 8.4.3 using the two-tailedunpaired Student's t test. In the T cell analysis, comparison of meanswas done using the two-tailed unpaired Student's t test, ANOVA and thenpost-hoc t test. P values of 0.05 or less were considered significant.

The invention thus has been disclosed broadly and illustrated inreference to representative embodiments described above. It isunderstood that various modifications can be made to the presentinvention without departing from the spirit and scope thereof.

It is further noted that all publications, sequence accession numbers,patents and patent applications cited herein are hereby expresslyincorporated by reference in their entirety and for all purposes as ifeach is individually so denoted. Definitions that are contained in textincorporated by reference are excluded to the extent that theycontradict definitions in this disclosure.

What is claimed is:
 1. An engineered immunogen polypeptide derived fromthe spike (S) protein of a coronavirus, comprising an altered soluble Ssequence that has modifications relative to wildtype soluble S sequenceof the coronavirus that stabilize the prefusion S structure, wherein themodifications comprise (a) a mutation that inactivates the S1/S2cleavage site, (b) a mutation in the turn region between the heptadrepeat 1 (HR1) region and the central helix (CH) region that preventsHR1 and CH to form a straight helix during fusion, and (c) a truncationof the heptad repeat 2 region (HR2).
 2. The immunogen polypeptide ofclaim 1, wherein the coronavirus is SARS-CoV-2, wherein the mutationinactivating S1/S2 cleavage site comprises substitution of ⁶⁸²RRAR⁶⁸⁵(SEQ ID NO:19) with GSAG (SEQ ID NO:20), and the mutation in the turnregion comprises double mutation K986G/V987G, K986P/V987P, K986G/V987Por K986P/V987G, wherein the amino acid numbering is based on SARS-CoV-2Wuhan-Hu-1 isolate.
 3. The immunogen polypeptide of claim 2, wherein thewildtype soluble S sequence comprises SEQ ID NO:14, or a variant with atleast 99% sequence identity.
 4. The immunogen polypeptide of claim 3,wherein the truncated HR2 comprises SEQ ID NO:9.
 5. The immunogenpolypeptide of claim 4, further comprising a N-terminal leader sequenceshown in SEQ ID NO:15.
 6. The immunogen polypeptide of claim 4, furthercomprising a trimerization motif at the C-terminus.
 7. The immunogenpolypeptide of claim 6, wherein the trimerization motif comprises foldonset forth in SEQ ID NO:26 or viral capsid protein SHP set forth in SEQID NO:27.
 8. The immunogen polypeptide of claim 4, comprising thesequence shown in any one of SEQ ID NOs:32-37, or a variant thereof withat least 99% sequence identity.
 9. The immunogen polypeptide of claim 1,further comprising the subunit sequence of a self-assemblingnanoparticle that is fused to the C-terminus of the altered soluble Ssequence.
 10. A polynucleotide sequence that encodes the immunogenpolypeptide of claim
 2. 11. The polynucleotide sequence of claim 10,wherein the immunogen polypeptide further comprises a N-terminal leader.12. The polynucleotide sequence of claim 10, encoding a fusionpolypeptide comprising the immunogen polypeptide that is fused at itsC-terminus to the N-terminus of the subunit sequence of aself-assembling nanoparticle.
 13. A coronavirus immunogenic composition,comprising the immunogen polypeptide of claim 1 that is displayed on thesurface of a self-assembling nanoparticle.
 14. The immunogeniccomposition of claim 13, wherein the self-assembling nanoparticlecomprises a trimeric sequence, and wherein C-terminus of the immunogenpolypeptide is fused to N-terminus of the subunit sequence of thenanoparticle.
 15. The immunogenic composition of claim 13, wherein theself-assembling nanoparticle comprises I3-01, E2p or ferritin.
 16. Theimmunogenic composition of claim 13, further comprising a locking domainand/or a T-cell epitope that is fused to the C-terminus of thenanoparticle subunit sequence.
 17. The immunogenic composition of claim13, comprising: (1) a polypeptide sequence containing from N terminus toC terminus (a) an engineered SARS-CoV-2 spike polypeptide, a GS linkersequence, and nanoparticle sequence I3-01v9, (b) an engineeredSARS-CoV-2 spike polypeptide, a GS linker sequence, and nanoparticlesequence E2p, or (c) an engineered SARS-CoV-2 spike polypeptide, a GSlinker sequence, and nanoparticle sequence ferritin; or (2) a variant ofthe polypeptide sequence with at least 99% sequence identity.
 18. Theimmunogenic composition of claim 17, wherein the engineered SARS-CoV-2spike immunogen polypeptide comprises (a) substitution of the S1/S2cleavage site ⁶⁸²RRAR⁶⁸⁵ with GSAG, (b) double mutations K986G/V987G inthe turn region, and (c) truncation of HR2 set forth in SEQ ID NO:9 atthe C-terminus of the wildtype soluble S sequence; wherein the aminoacid numbering is based on SARS-CoV-2 Wuhan-Hu-1 isolate.
 19. Theimmunogenic composition of claim 17, wherein the engineered SARS-CoV-2spike immunogen polypeptide comprises SEQ ID NO:33 or 34, or a variantthereof with at least 99% sequence identity.
 20. The immunogeniccomposition of claim 19, comprising (1) a polypeptide sequencecontaining from N terminus to C terminus (a) the engineered SARS-CoV-2spike polypeptide shown in SEQ ID NO:33, linker sequence shown in SEQ IDNO:22, nanoparticle sequence shown in SEQ ID NO:23, locking domain shownin SEQ ID NO:29, and T cell epitope shown in SEQ ID NO:30, (b) theengineered SARS-CoV-2 spike polypeptide shown in SEQ ID NO:33, linkersequence shown in SEQ ID NO:21, nanoparticle subunit sequence shown inSEQ ID NO:24, locking domain shown in SEQ ID NO:28, and T cell epitopeshown in SEQ ID NO:30, or (c) the engineered SARS-CoV-2 spikepolypeptide shown in SEQ ID NO:33, linker sequence shown in SEQ IDNO:21, nanoparticle sequence shown in SEQ ID NO:25; or (2) a variant ofthe polypeptide sequence with at least 99% sequence identity.
 21. Theimmunogenic composition of claim 19, comprising the sequence shown inany one of SEQ ID NOs:38-40, or a variant thereof with at least 99%sequence identity.
 22. A polynucleotide sequence that encodes apolypeptide sequence, wherein the polypeptide sequence comprises theimmunogen polypeptide fused to the subunit sequence of the nanoparticleas set forth in claim
 14. 23. A pharmaceutical composition, comprisingthe immunogenic composition of claim 13, and a pharmaceuticallyacceptable carrier.
 24. A method for inhibiting a SARS-CoV-2 infectionin a subject, comprising administering to the subject a pharmaceuticallyeffective amount of the immunogenic composition of claim 13.